Gene Htur_4177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4177 
Symbol 
ID8744805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp445711 
End bp447351 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content61% 
IMG OID646514725 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003405672 
Protein GI284167394 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02262] benzoate-CoA ligase family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATGC AGGCCACTAT CACAGAGGAG CGTCTGCCGG ACGAAGAGAA TGCCCCGGAG 
TACGTCCATT CGCTTCCGGA GCTCCATTAC CCGAACCAGA TCAACGTCGT CGACGAACTG
GTCGACCGCC ACGTTCGCGA AGGGCGCGGT GACAACGTCG CGATCTATTT CGAGGATCGG
GAGATCACGT ACGAAGAATT GCAGGAGACC GTGAACAGAA TGGGGAATGC GCTGCGCGGC
CTCGGCGTCG AAGCCGGTGA TCGGGTCGTC GTTCGATTCC CTAACCGACC GGAAGCGATT
ATCTCGTGTC TGGCGGTTCA GAAGATCGGC GGCGTTGCTC TCCCGTCGAT GAAACTCCTC
CGGGCGAAAG AGCTCGAACA CATCGTCAAC GACGCCGAAG CGTCGGCCAT CGTCGTCTAC
GATGATCTCC TCAGAGAAGT GGAGAACGCG CTGCCGGAAC TCGAAACCGT CGACGACGTC
ATCGTCGCCG AGCGCAACGG GATCGATCAC AGCTATCACA GCTACGACGG CCTACTCGAG
GACGCCAGTG ACGAACTCGA GGCGTACAAC ACCGAACGCG ACGATCTCGC GCTGATGCTG
TATACGAGCG GAACGACCGG GCAACCGAAG GGTGCGATTC ATACCCACCG GAACATGTTG
GCCACCGCGG ACTCCTACGC GCGGTACTGC CTCGAGCCGA CCGAGAACGA CGTCTTCGGC
GGGAATCCAC CGCTCCCCTT CGCGTACGGA TACGGCGACC TCGTGACGTT CCCGCTCCGG
TTCGGCGCGA GTACGAGTCT CGTCGAAGAC GCGGATCCCG GCGATTTACT GGAAGCGATC
GAGGCCCACG GGGTTTCGAT CCTCTGTTCG ATCCCGACGG GATTCAATCA GATCCTCTCC
CAGTATCCCG ACGGTCCTGA CGACTACGAC GTCTCATCGC TCCGTCTCGG GCTCAGCGCC
GGCGAACCGC TGACGCCGAC GACGTTCGAG GAATTTAAAT CCGAGTACGG AATCGATCTC
CTCGACGGAA TTGGAACGAC GGAGATGCTC CATATCTTCA TCAGTCACCG TCACGACGAG
GAAATCGATC CGAGCGCGAC CGGGTATCCG GTCCCGGGGT ACGAGTGTAA GATCATTGAC
CCGGACACGG GCGAAGACTT AGAGCGCGGC GAAGCGGGAC TCCTCGCCGT TCGCGGGCCG
ACCGGAATCG AATACTGGGA CCGACCCGAG AAACAACTCG AGGTAAACCA GGACGGGTGG
TCGATCCCGG GCGACATCTT TGTACAGCGC GAAGACGGTC GCCTGGAGTA CAAATCCCGC
GACGACGACC TCATCATCTC GAGCGGCTAC AACATCCCCG GGCCCGAGGT CGAAGCGGTC
ATCGAAGAAC ACGAATCGGT CTCCGAGGTC GCTGTCGTCG GCAGTCCGGA CGAGCAACGC
GGCGAGGTCG TGAAAGCGTT CGTCGTCCTG AACGACGGCG CCTCCGAGGG AGACGAACTC
GTGACGGAAA TACAGAATCA CGTCAAGAAC AACCTCGCAC CGTATAAGTA TCCGCGCGAG
GTCGAGTTCA AGAACGCCCT CCCGCGGACG GATACCGGAA AGATTCGACG GACGGAACTC
CGACAGTTGG AGCGCCGATA G
 
Protein sequence
MRMQATITEE RLPDEENAPE YVHSLPELHY PNQINVVDEL VDRHVREGRG DNVAIYFEDR 
EITYEELQET VNRMGNALRG LGVEAGDRVV VRFPNRPEAI ISCLAVQKIG GVALPSMKLL
RAKELEHIVN DAEASAIVVY DDLLREVENA LPELETVDDV IVAERNGIDH SYHSYDGLLE
DASDELEAYN TERDDLALML YTSGTTGQPK GAIHTHRNML ATADSYARYC LEPTENDVFG
GNPPLPFAYG YGDLVTFPLR FGASTSLVED ADPGDLLEAI EAHGVSILCS IPTGFNQILS
QYPDGPDDYD VSSLRLGLSA GEPLTPTTFE EFKSEYGIDL LDGIGTTEML HIFISHRHDE
EIDPSATGYP VPGYECKIID PDTGEDLERG EAGLLAVRGP TGIEYWDRPE KQLEVNQDGW
SIPGDIFVQR EDGRLEYKSR DDDLIISSGY NIPGPEVEAV IEEHESVSEV AVVGSPDEQR
GEVVKAFVVL NDGASEGDEL VTEIQNHVKN NLAPYKYPRE VEFKNALPRT DTGKIRRTEL
RQLERR