Gene Htur_3692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3692 
Symbol 
ID8744318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3804308 
End bp3805888 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content63% 
IMG OID646514279 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003405227 
Protein GI284166948 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGTGA ACCAGTCAAC GACGCGGCGG CACCTCCTCG CTTCGGGGGC CGCGATTTCC 
GCGTCAGTCA TCGCAGGGTG CATCGGCGGC GGCGGTGGCG GCGACGGGAA AGCCTTCCGC
TTCACGCAAG AGCAGTCGCG AGAGGAGCAG TTCGATCCCG TCGTCTCGAA CGACGCGTAC
AGCTTTCAGG TGATTCAGCT CGTCTTCGAC GGGCTCTACG AGTACAGCGA AGGACTCGAG
CTCCAGCCGA AACTCGCGAC GGGCGAGCCG ACCGTCGAGC GCGACGGCAC GCGGTACATC
TTCGAGATCG TAGAGGGTGC CACGTTCCAC AACGGAAACG AAGTGACCGC CGGAGATGTC
GCGCACTCCT TTACCGCGCC GGTCGAGGAG GAGACGGAGA ACGCCTCGGA GTACGACATG
ATCGAGAGCA CCGAGGTCAT CGACGACTAT CAGCTCCAGG TCGACCTCGG GGAGGATCCG
TACGGCCCGT TCGAACTCGC GACGATGGGC GTGACGGTGG TCCCAGAGAG CGCTCGAACC
GAGGACCGCG AGGCGTTCAA CACGAATCCG ATCGGCTCGG GACCGTTCAC CTTCGCCGAA
CTCCAAGAGA ACGAGTACGT CGAGATCGAA CGCAACGACG ACTACTGGGA CGACCTCGAG
CCGAACCTCC AGCGGGTCCG CTTCGAAGCT CACGACGACA ACGCGGGTCG CGTCTCCGAC
ATCCGGTCGG GGAACACCGA CGCCATCGCC GGCGTGCCCA ACGAGGACTG GAGCGTCCTC
GAGAACGAGG AGAACGTCAC TCTTCATTCG GCGGAGAGTC CGACGTTCAT GTACATGGCG
TTTAATTGTA ACGAGGGGCC GACAACGAGT CCCGAAGTGC GGCGAGCGAT CGCCCACTCG
TTCTCGATGT CGGACTTCAT CGAGTCGAAC GCCGCGAACG TGGCGTCGCC GATGTACAGT
CCGATCCCGC CCGTCGTCAA TGAGGTCTGG GGCTTCCCCG AGGACGAGTA TCAGGAACTG
TTGCCGTCGT ACGACCCCGA GGAGGCGCAG TCGCTACTCG ACGAACACGC GCCCGACGGC
TTCACGCCGA CGATCATCAC GCCGGAGGGA ATCCGCGCCC AGTTAGCCGA ACGGATCGCG
ACTCGGTTGG ACGAGATCGG GTACGGTGCG GACGTACAGG TACTGGACTT CGCGACGCTG
GTCGACACCT ACACCAGCGG AAGCGCCGAC GACTACCAGA TGTACCTGCT GGGCTGGACC
GGCGGTCCCG ATCCGGACTA CTACCTCTAC CCGCTGTTCC ACGAGAGTCA GGCGGGAACG
AATCAGGGCC ACTTCTACGG CGGGAGCGAC GGGTTCCACG AGGCGATCGC CGAGGGACGC
AACTCCGCTG GACAGGAGGA GCGCTACGAC ATTTACGAGC CCGTCATCCG AGAGATCGTC
GAACAACTGC CTGCCCTCCC GGCGTTCACG CAGGACAACA CGATGGCCTC GCGCAACTAC
GTCCAGGACC TGCAGGCACA CCCGGAAGTG ACGCGGAACC CGACGCTCGT CGCAGAGTAT
ACGAACGTAT CGATGGAGTG A
 
Protein sequence
MVVNQSTTRR HLLASGAAIS ASVIAGCIGG GGGGDGKAFR FTQEQSREEQ FDPVVSNDAY 
SFQVIQLVFD GLYEYSEGLE LQPKLATGEP TVERDGTRYI FEIVEGATFH NGNEVTAGDV
AHSFTAPVEE ETENASEYDM IESTEVIDDY QLQVDLGEDP YGPFELATMG VTVVPESART
EDREAFNTNP IGSGPFTFAE LQENEYVEIE RNDDYWDDLE PNLQRVRFEA HDDNAGRVSD
IRSGNTDAIA GVPNEDWSVL ENEENVTLHS AESPTFMYMA FNCNEGPTTS PEVRRAIAHS
FSMSDFIESN AANVASPMYS PIPPVVNEVW GFPEDEYQEL LPSYDPEEAQ SLLDEHAPDG
FTPTIITPEG IRAQLAERIA TRLDEIGYGA DVQVLDFATL VDTYTSGSAD DYQMYLLGWT
GGPDPDYYLY PLFHESQAGT NQGHFYGGSD GFHEAIAEGR NSAGQEERYD IYEPVIREIV
EQLPALPAFT QDNTMASRNY VQDLQAHPEV TRNPTLVAEY TNVSME