Gene Haur_2657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2657 
Symbol 
ID5734552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3410085 
End bp3411938 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content50% 
IMG OID641279799 
Productextracellular solute-binding protein 
Protein accessionYP_001545423 
Protein GI159899176 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000297428 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAAC GAAGCAAACG CTTAGCCGCC ATCGTGCTTA CTGGCGCGAT TTTGGCAGCA 
TGCGGTAGCG GCACCAGCAC CACACAACCA ACCACTGGCA GCAGCGAGGC AACCGCCGTC
GTTGAGCCAG GTACTGATAC TGGGTCGCAA CCCAGCAGCG ATGGCATTGT AATCATTGGG
ATGAGCCAAT CGCCAGACAC CTTATTTGGG ATGGAATCGC AATCGAGCGC CACCACCCAA
GTGTTAAACT CAGTGCAACC AGCCTGTGTA ACGACCCTCA GCTACGATTA TCAACCAGTC
TGTTTCGCTG AATTGCCCAC CTTTGAAAAT GGCGGCGCAG TCGAAGAAAT GGTAACAGTT
GATCAAAGCT ATACCGGCCC ATTCGTGATC GAGAATGAAC TGATCACTGA TACCAGCGTT
TTGACTGGGC CAATCGAATT GCCCCAAGTT AAAGTGACAT GGAAGTTGAT CGATGGCATT
ACTTGGGAAG ATGGCACACC AGTAACCGCC GATGACTTTT TGTTTGCTGC TGAATTGTAT
GCTAACCCAG GCACCAAAGT CGCCAGCCGC TTCACCATCG AGCACACCGC CAAATACGAA
AAAGTTGATG ATCAAACTTT CTCATGGTAT GGCGTGCCAG GCTATCAAGA TTCAACCTAT
TTCTTGAACT ACGCAGGTGG CGCACTTGGC CCTGAACCCA AGCATGTGCT CGGCAGCGTC
GATGCAGCAA CCATCGGCGG TAGCGATTAT TCCAGCAAAC CGTTAGCTTA TGGCCCGTAC
AAAATTGACG AATATGTGCC GCAAGAACGG GTGACAATGA GTGCTAATCC ACATTATTGG
GGAGCAAAGC AAGGCCTACC CAAAATTAGC AATGTAATCT ATAAATTCGT CACCAGCGAA
GATCAAATTT TGCAACAATT GCAATCAGGC GAAATCGATG TCGTTGGCCA AATTGGCTTA
TCGCTAGCTC AAGCGCCATC ACTTGATGAA TTGCAAGCAG GCGCTGAATT CGATATTCAA
TATGTGCCAG CCACGGTTTG GGAACATATT GACTTTGCTG TTGAGCGCGG CGATGGCGTT
GCAACCCCAT TTGCTGATGC CAAAGTGCGT CAAGCAGTCG CCTATGCAAT CAACCGCCAA
GAAATCATCG ATCAGATCTT GTTTGGTAAA ACCGTCGCAA TGAACAGCTT CATGCCCGAT
GATCACTGGG CCTATCCCAG CGATGCCAGC GTGATCAACT CTTATGCCTT TGATCCCAGC
AAAGCGATTC AATTGCTGAA TGAAGCTGGT TGGGTTGCTG GCGATGATGG CATTTTGGTC
AAAGATGGCG AACCATTCAA GGTTGAATTC TTCACCACCG AAGGCAACGA TACCCGTCAA
GCAGTCGCCC AATTGGTGCA AGAATACTTG CGCGATGTCG GGATCGACGT TGAATTAAAG
TTTGTCGCTG GCACTGATGT GCTGTTCAAA AATGGCTCGG AAGGTATTTT GGCAGGTCGC
CGCTACGATA TGGCATTGTA TGCTTGGGTC AGTGGCCCTG AGCCATCAAC GCCGTTGTAT
CTGTGCAGCC AAGTGCCAAC AGCAGCCAAT GGCTACGCTG GCCAAAACAA CACTGGCTAC
TGCAACCCCG ATTACGATAA AGTAGCGCTC GAAAGCCAAA GCATCATCGA ACGGGCTGGC
CGCTTGCCAT TGCTTAGCCA AGCCCAACAA ATCTTCAACC GCGATTTGCC AACCTTGCCG
TTATATCAAC GGATCAACGT TGGGGCTGCC CGCAAAACAA TCAGCGGCTT CAAGCTCGAT
CCAACCAGCC AACAAGATTT CTACAACATC GAAACCTGGG AATTAGCCCA ATAA
 
Protein sequence
MFERSKRLAA IVLTGAILAA CGSGTSTTQP TTGSSEATAV VEPGTDTGSQ PSSDGIVIIG 
MSQSPDTLFG MESQSSATTQ VLNSVQPACV TTLSYDYQPV CFAELPTFEN GGAVEEMVTV
DQSYTGPFVI ENELITDTSV LTGPIELPQV KVTWKLIDGI TWEDGTPVTA DDFLFAAELY
ANPGTKVASR FTIEHTAKYE KVDDQTFSWY GVPGYQDSTY FLNYAGGALG PEPKHVLGSV
DAATIGGSDY SSKPLAYGPY KIDEYVPQER VTMSANPHYW GAKQGLPKIS NVIYKFVTSE
DQILQQLQSG EIDVVGQIGL SLAQAPSLDE LQAGAEFDIQ YVPATVWEHI DFAVERGDGV
ATPFADAKVR QAVAYAINRQ EIIDQILFGK TVAMNSFMPD DHWAYPSDAS VINSYAFDPS
KAIQLLNEAG WVAGDDGILV KDGEPFKVEF FTTEGNDTRQ AVAQLVQEYL RDVGIDVELK
FVAGTDVLFK NGSEGILAGR RYDMALYAWV SGPEPSTPLY LCSQVPTAAN GYAGQNNTGY
CNPDYDKVAL ESQSIIERAG RLPLLSQAQQ IFNRDLPTLP LYQRINVGAA RKTISGFKLD
PTSQQDFYNI ETWELAQ