Gene Haur_1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1605 
Symbol 
ID5733507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1862688 
End bp1864475 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content52% 
IMG OID641278744 
Productextracellular solute-binding protein 
Protein accessionYP_001544376 
Protein GI159898129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTCC GTAAACCAGG TAAGCGTTCG ATTGGCTGGC TATTGCTCTT GGTATTGATG 
ATACCGTTGA TAGCTGCCTG TGGCGAAACC GCCGCTCCAA CCGCAACTGT TGGCTCAACT
CCCGCTACTG GTGGCGAACC CACTGCTGCC CCAACTACCG CTCCTACAGA TGACACCGCT
GCAGCCACAC CAACCGACGC CGCTGCTGCA ACCGAAGAAC CAACCATGGG CGATGCTGAT
AAGTATCTGG TATTTGGTGG TTCGGGCGAA CCCGATTCAC TCGATTCGAT GGATACGACC
ACGGGTACTG CCTTGATTGT AACCCGCCAA ATCCAAGAAT CGTTGTTGGG TTTCAAGGCT
GGTACGTTGG AAGTTGTGCC CGAATTGGCG ACCAAGTGGG AGCCAAACGC CGATGCAACC
GAGTGGACGT TTACCCTGCG CGAAGGGGTT AAATTCTCTG ATGGCACCGA CTTCAACGCT
GATGCAGTAG TTTTCAACTT CCAGCGCTTG TTTGTGCCTG ATTTTGAGTT TGGCTTCCGT
GCCGAAGGCA AGCAATACAA CATCGTGCCC GATATTTTTG GTGGCTATGC TGGCGACCCC
AACAGTGCCT TCAAAGAAAT TATGGCAGTT GATCCAACCA CGGTTAAGTT TGTTTTGACG
CGGCCTGTGC CGTTGTTGCC AAGCTATTTG GCCGCTTCCT ACTTCGGAAT TTCATCGCCC
GAAGCAGTCA AGGCTGCCAA AGAAAAATAT GGCTCACCAG AAGTTGGTGG CGTTGGGACT
GGCCCCTTCA AGTTTGAGCG CTGGGATGCT GGTCAAAGCA TCACCTTGGT GCGCAACGAA
GATTATTGGG GCGACAAGGC CAAAATGCCA GGTGTGGTTG TGCGCTTTAT CGCCGAAGCA
CCCCAACGTT TGGCCGAGCT TGAAGCTGGC ACAATTGATT TCACAATCAA CTTGAGCGCT
GATAGCCGCG ATAAAATTGC TTCAAGCGCC GATTTGCAAG TGGTTGATTT GACTCCATTC
AACATTGCCT ACTTGTCGTT GAACATGAAC AACAAGCCTT TTGACGATGT GCGGGTTCGT
CAGGCGGTTG CCTATGCCAT CAACAAGCAA GAAATTCTTG ATGCCTCGTA TGGCGGCGTT
GGCTCAATTG CCGACGACTT CTTGCCCGAT GGATTGGCTG AATATCGGGC GACTGACCTC
GAACCATATG CTTATGATCC AGAAAAAGCC AAAGCCTTGT TGGCCGAAGC TGGCTATGCC
GATGGCTTTA GCACCATGGT CTTGACCGAT GGAACTGAAT TGCCCTTGGA ATTGTGGTAT
ATGCCGGTTT CACGGCCTTA CTACCCCGAT GCTAAGTCAG TGGCTGAACT CTACGCCGCC
CAACTTTCCG ACGTTGGGAT CAAGGTTGAA CTCAAGACCG AAGATTGGGG CGTGTATCTC
GATAACTGGG ATGCTGGCCT GAAAAACGGG ATGGTGATGT TGGGTTGGAC GGGCGACTAT
GGCGACCCTA ACAACTTCTT GTTCACTCAC TTTGGCCCAG GCAACGCCGA CGAAGCTGGT
TATACTAACG AGAAAGTTTG GCAATTGCTG GCCGATGCTG GTGGCGCTTC CTCGCCCGCC
GAGTCAATTC GGCTGTTCCA AGAAGCTGGC AAGTTGATTA ACACCGATTT GCCACGGATT
CCGATCGTGC ATGCTCCACC CGTATTGGCT GCTAAAAAAG CCTTGCAAGG CTGGGTGCCA
AATCCAACTG GTGGCGAATC ATTCGCACCG ATCTCAATCA CGAAATAA
 
Protein sequence
MSFRKPGKRS IGWLLLLVLM IPLIAACGET AAPTATVGST PATGGEPTAA PTTAPTDDTA 
AATPTDAAAA TEEPTMGDAD KYLVFGGSGE PDSLDSMDTT TGTALIVTRQ IQESLLGFKA
GTLEVVPELA TKWEPNADAT EWTFTLREGV KFSDGTDFNA DAVVFNFQRL FVPDFEFGFR
AEGKQYNIVP DIFGGYAGDP NSAFKEIMAV DPTTVKFVLT RPVPLLPSYL AASYFGISSP
EAVKAAKEKY GSPEVGGVGT GPFKFERWDA GQSITLVRNE DYWGDKAKMP GVVVRFIAEA
PQRLAELEAG TIDFTINLSA DSRDKIASSA DLQVVDLTPF NIAYLSLNMN NKPFDDVRVR
QAVAYAINKQ EILDASYGGV GSIADDFLPD GLAEYRATDL EPYAYDPEKA KALLAEAGYA
DGFSTMVLTD GTELPLELWY MPVSRPYYPD AKSVAELYAA QLSDVGIKVE LKTEDWGVYL
DNWDAGLKNG MVMLGWTGDY GDPNNFLFTH FGPGNADEAG YTNEKVWQLL ADAGGASSPA
ESIRLFQEAG KLINTDLPRI PIVHAPPVLA AKKALQGWVP NPTGGESFAP ISITK