Gene Haur_3496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3496 
Symbol 
ID5735357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4402834 
End bp4404606 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content50% 
IMG OID641280643 
Productextracellular solute-binding protein 
Protein accessionYP_001546260 
Protein GI159900013 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGACC TTGGATTGAT GCTCATGAAA TTGCGTTTTA GCCTTGCGCT GTTGCTCTGT 
TTTAGCTTGA TTGGTTGTAG TATTGAGGGT GGTGCACCAG CTGGTCAGCC CACTGTCCAA
GTGCCAACTC CCTTGCCAAC GACCAACCCT GCCACTTCCA ATCTTTGGAC GTTGGGCTTA
ACGGAGGAGC CAGTTGATCT GTATCCCTAC AGTTATGCCT TTCGTGGCGC AGCTCCATTG
ATTGAATTGC TTTACCCTGC GCCTTTGACG GTGGTTGCTG AAGCCTATAC CACAACTGGC
GTTTTGGAGC GCGTGCCATC ATTCGAAAAT GGTGATGTGC AATATATTAC AACCACGGTT
AATCTTGATG CGAATGGCGT AATTACCACC ACGCAAACCG AAACGATTAC CAGCGTGCGT
CAGTTGACCG TCACCTATCG CTGGAATAAA AATTTGCAAT GGGAAGATGG CCAACCATTA
ACCGCAGCTG ATTCAGTTTT TGCCTATAAT TTGTTGCGTG GTAGCGCCTC AACCGCCCAA
TTGGCAACCC AAAGCGACTT AACCGCCGAT TATGTGGCGA TTGATCAATA TACGACTCGT
GCTTATTTGC CGCCCGAACG CGACGACCCA AATTATTTGG TGACGGTCTG GACTCCATTG
CCAGCGCATT TGTTTGAAGG TCAGCCAAGT GCCAAAGAAG TTAGCGATCG TTTGGCCCAG
TCGCCAGTTG GCTATGGCCC GTATACGCTC AAAGCCTGGA CGGCAGGCAC GCAGTTGGAA
TTTGTGCGAC GCGAAGGTCA AACGGAACAA TTGCCTAGCA CGATTATTGC CCGTTTATAT
CCTGATATTG CCATGATGCG CGACGATGTG CTGAGCGGGC GGGTCGATGT GGCTTGGACA
GAAGGTTCGT TGGAGCAATT AGCGCTTGAT CTGAAAACTG ATGTGCAATC CAAAACCTTG
CAATTGCTCC AAGCTCCCAA CCCAATTTGG GAACATATCG ACATGAATTT GGCGGTTGTA
GCTTTGCAAG ATATTCGGAT GCGCCAAGCG ATTGCCCATG GTTTTGACCG TGAGGCGATT
AGCACAACCT TGTATGGCAC GCCCAAAGCA GTTTTGCATA GTTGGTTGGC GGCTGAATCA
TGGGCTTTTG ATCCAACAAC AGTGGTCAGT TATACCTTTG ATCCTGCGCT TTCACGCCAA
TTGCTTGATG AAATGAATTA TCGTGATACC AATGATGATG GTTTGCGCGA ACGCCCTGAT
GGTACGCCCT TCCAATTGAC ATTAACCACT TCGGCTCAAA CCCCGATTCG CCAACGCCTG
AGTGAGCAAT TTGTCAGCGA TATGCAGGCA ATCGGGATTG ATATTAAGGT TGAAGCCTTA
TCAACCACCG ATTTGTATAG CCAGCAAGGG CCATTATTTG GGCGACGCTT TGAATTGGCC
TTGTTTGGCT GGCTGCGCAG CGTTGATCCC GATGGCGCGG TGTTGTGGAG TTGTGCCGCG
ATTCCTAACC AAATTAACGG CTACTCCGGC GATAATTTTA CTGGTTGGTG TATGGATACC
GCCGATCGGG CGATTCGTAC CGCGACCAGT TCGCTTGATC CGGCGGTGCG TAAGGCTGCC
TATAGCGAAC AACAGCAAAT TTTCACCCGC GAATTGCCAG TCTTGCCCGT GATTACCCGC
CAAACCACGG TGTTGCTTGC GCCGAATGTG CGAGGTGTGC AACCGCAACC ATTGGCTCCA
ATTACTTGGA ATGTGGGTGC TTGGCAACGC TAG
 
Protein sequence
MLDLGLMLMK LRFSLALLLC FSLIGCSIEG GAPAGQPTVQ VPTPLPTTNP ATSNLWTLGL 
TEEPVDLYPY SYAFRGAAPL IELLYPAPLT VVAEAYTTTG VLERVPSFEN GDVQYITTTV
NLDANGVITT TQTETITSVR QLTVTYRWNK NLQWEDGQPL TAADSVFAYN LLRGSASTAQ
LATQSDLTAD YVAIDQYTTR AYLPPERDDP NYLVTVWTPL PAHLFEGQPS AKEVSDRLAQ
SPVGYGPYTL KAWTAGTQLE FVRREGQTEQ LPSTIIARLY PDIAMMRDDV LSGRVDVAWT
EGSLEQLALD LKTDVQSKTL QLLQAPNPIW EHIDMNLAVV ALQDIRMRQA IAHGFDREAI
STTLYGTPKA VLHSWLAAES WAFDPTTVVS YTFDPALSRQ LLDEMNYRDT NDDGLRERPD
GTPFQLTLTT SAQTPIRQRL SEQFVSDMQA IGIDIKVEAL STTDLYSQQG PLFGRRFELA
LFGWLRSVDP DGAVLWSCAA IPNQINGYSG DNFTGWCMDT ADRAIRTATS SLDPAVRKAA
YSEQQQIFTR ELPVLPVITR QTTVLLAPNV RGVQPQPLAP ITWNVGAWQR