Gene Htur_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1045 
Symbol 
ID8741632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1081275 
End bp1083098 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content65% 
IMG OID646511623 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003402610 
Protein GI284164331 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGTA ATCCAACCGA TCCTGTCGAC GGCGTCGATC GACGCTCCGT CCTGGCTGCG 
GGCGCAGCCG GGCTCTCGCT CTCCCTGAGC GGCTGTGTCG ATACGGTCCA ACGCGTCGTC
GACCAGGACG GTACCGACCA ATTGTCGCTC TCCATCGTGA CGGTCCCCGC GGACGCCGAC
CGGGAAAGCA TTCGGATCGC CTATCACCTC GAGTCGCATC TCGAGGCGAT CGGCGTGGAC
GTGACCCTCA AGGTGCGCTC TCGTACCGAC TTCCTCAAAA CGGTCCTGAT CGACCACGAC
TTCGATATCT ACGTCGGTCA GCATCCCGCC GATTACGATC CGGACTTCCT CTACGAAGCC
CTCCACTCCA CGTACGCAAA CGAGGCCGGC TGGCAGAACC CCTTCGGGTT CGACAGCATG
GCCTTCGACA CCCTCCTGGA GGATCAACGC CGGGCCGACG GTGAGCAGCG CAAGCAACGC
TTAGCAAATG TACTGCGCGG AGTTGCAGAC GAGAAACCGT TCGATCCGAT CTGTCGTCCC
GACGAGATCC GGGTCGCTAA CACCACCCGC TTCGACGGTT GGGATCAGGG TCACCTCGCG
ACGCGACGCG GCTATCTCGG CCTCGAGCCG GACGCCGGCG TCGAACGATT GAACGCGCTC
GTGACCGACG CCAGACCGTC GGTCAACGTC AACCCGATCT CGGCGACGGT CCGGGAACGG
GGGACGGTCG TCGACCTGCT GTACGACTCG CTCGGGACCG TCGTCGACGG CGAGGTCCTG
CCGTGGCTCG CGGAGTCGTG GGAGTGGGTG ACCGATGCCG AGACCGACGA GAAGAAAACC
GAAGAGATCG CCGAGCCGAC CCGAAACACG ACGACGGCGC GGGTTTCGCT CCGAGAGGAC
TGTCGGTTCC ACGACGGCGA ACCGGTTACC GCAGCGGACG TCGAGTTCAC CTATCAGTTC
TTCCAGGATA CCGTGCTCGG TCACGCGACG CCGTCGCCGC CGCCCCGCTA TCGCGGTCAC
GCGAGCGCGA TCGACGATAT CGAGATCGAG GACGAGTACA CGCTGCGGAT CACCGCCGCC
GCCGGCACGG ACGTCTGTGA ACGCGCGTTT ACCGCCCCGA TCCTCCCGAA ACACGTCTGG
CAATCGGAAC TCGAGGACCG GCTCGGTAAC TCTCAGGAGT TTTCGGCGCC GCAAGGATCC
TGGAGTTTGG TCACCAGCGA CTCGATCGAT CCGACCGGGA GCGGTCCCTA CCAGTTCAAG
AACAACTCCG AACGGGAACA CCTCACGCTC GAGCGGTTCG ACGATCACTT CACGCTGCGC
GAGGACGCCG CAGGCGACCA CCTCCTCGCT CCACGCGTCG AGGAGCTCCG GTTTATCGTC
GATCCCGGCA GCCCGTCCTC CATCTCGCGG GTCGCCAGCG GCAACGCCGA CCTCACCTCG
TCGATGCTCG CGGCGTACTC GCTCAGCGAC ATTCCAGAGG ACAACCCCGA CGTCGAACGG
CTCGAGTCGC CCTCGTGGAC GTTTTACCAC CTCGGGTTCA ACACGCGGGC GCCGCCGTGT
AGCAACCTCC ACTTCCGGCG CGCGATCTGT CGACTGATCG ACAAGGAGTG GATCGCGAGC
GAAGTCTTCG GCGGCCACGC GGACCCGCTC GTGGCTCCGG TGACCGAGGA GTGGACGCCC
GACGACCTCG CGTGGGACGG TGCGGACCCA GAAACGCCAT TTGCCGGCAC CGACGGGACG
CTCAACGTCA ACGCGGCGCG TAACGCGTTT CAGGCGGCCG GCTACTACAC CGACGACGAG
AACCGACTAC AGGGGCGATA CTGA
 
Protein sequence
MNCNPTDPVD GVDRRSVLAA GAAGLSLSLS GCVDTVQRVV DQDGTDQLSL SIVTVPADAD 
RESIRIAYHL ESHLEAIGVD VTLKVRSRTD FLKTVLIDHD FDIYVGQHPA DYDPDFLYEA
LHSTYANEAG WQNPFGFDSM AFDTLLEDQR RADGEQRKQR LANVLRGVAD EKPFDPICRP
DEIRVANTTR FDGWDQGHLA TRRGYLGLEP DAGVERLNAL VTDARPSVNV NPISATVRER
GTVVDLLYDS LGTVVDGEVL PWLAESWEWV TDAETDEKKT EEIAEPTRNT TTARVSLRED
CRFHDGEPVT AADVEFTYQF FQDTVLGHAT PSPPPRYRGH ASAIDDIEIE DEYTLRITAA
AGTDVCERAF TAPILPKHVW QSELEDRLGN SQEFSAPQGS WSLVTSDSID PTGSGPYQFK
NNSEREHLTL ERFDDHFTLR EDAAGDHLLA PRVEELRFIV DPGSPSSISR VASGNADLTS
SMLAAYSLSD IPEDNPDVER LESPSWTFYH LGFNTRAPPC SNLHFRRAIC RLIDKEWIAS
EVFGGHADPL VAPVTEEWTP DDLAWDGADP ETPFAGTDGT LNVNAARNAF QAAGYYTDDE
NRLQGRY