Gene Haur_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4018 
Symbol 
ID5735879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5125407 
End bp5127068 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content51% 
IMG OID641281168 
Productextracellular solute-binding protein 
Protein accessionYP_001546778 
Protein GI159900531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGAA TATTGACCCT AGGATTATTG GCTGTTTTGC TCACAGCATG TAGCCTCGGC 
TCCACACCAG CACCCACCAA CGAGCCAACC ACACCGCCAA TTCAGGTTCC GCAAGGTGGT
ACGTTAACCA TTCGCACGGC CCAAGATATT GCAGCTTTGC ACCCGTGGAA ACCAACCTGC
CACGAAGAAG CTCAACTTTT AGGCCTGCTC TATCGTGGTT TAACTAAGCT TGATCAAAGC
CTCGCTCCGC AACCTGATGT TGCTACAAGC TGGCAAAGCG ATAGTATCGG CCAAACCCTG
ACCATGACCT TGCGCAGCGA TATTCGTTGG CATGATGATA CGACCTTGAC TGCTGCTGAT
GCGGCTTGGA CGATCAGCGC CATGCAAAGC ATCAGCCCAA CCACGCCATT ATTGACCGAT
CTTCAGGGCT TGGTGCGCAA GGTTACTGCA CCCGATGACA CAACTTTGGT CATTTCGTTG
CGCGAACCCT ATGCGCCATT GCTCTCGGCC TTGAGCATGC CAATTTTGCC CAAACATGTG
TTTGAGCAAT TAAGCCCAGT TGAGCTTGAT CAGCTTAATC TTTTGACGCA GCCAATTGGT
AGCGGCCCGT TTATGTTCGA GCAACGGACT GCTGGCTCAG CAATTAGCTT AATTCGCAAT
AGCAACTATA TCGATGGCGT GCCCTATCTC GATCGGGTGG CCTTTGTGGT TGCCCCCGAT
CCGCAAGTGG CTCGCCAAGC AGTGCGCGAT GGAGATTTAT TGGCAGCAGA ATTGCCATGG
GCACAAAGCC AAGGCTTAGG GCCAACAGTT GGCATAGGTA GTTATCCTGA AAATGGTTTT
TATTATTTGG CCTTTAATAT GCGTGATGGT CGCATCTTCA GCGATCAGCG AGTGCGCCAA
GCGCTGGCAT TAAGCCTCGA TCTCAATACA ATTGTCGAAA CCGCTGGCCC AGCCGCTCAA
GCAATTTTGA GCGATCATTT GCCTGGCACA TGGGTTGCGC CAACTGGTGA GTTGCCCAAA
CGCAACTTAG ATCAAGCTCG CGAATTACTG GATCAAGCAG GCTGGGTCTT GCCCGAAGGT
GCGACAATTC GCGCCTCGAA CGGGATTACG CTGTCGATGG CGCTGTTTGT GCGGGGCGAT
GACCAACGCC GCATCGAGGT TGCCGAACGG ATCGCCGCTG CTACACGTCC GGTTGGCTTC
AATATTGTGG TTACGCCAGC CGATTTCGAG AGCGTGATTC GCTCTAAATT GGTAACACCC
TTTGATTTTG ATTTGGCTTT GATGAGTTGG GGCAATAGTC GAGTTGGTGG TTCGCCCTCG
TATACCGCCT ACGATCCTGA TAATTTCTCG TTGTTTCATT CGAGCCAAAT TTACCAAGGG
GTTGCCGATG GACGGCCTGG CCTGCGCAAT TATGGTGCGT TTCAAAACAC CAGTTTCGAT
AATTTATCGA CGGCAGCGCG GGCACTTTAC GCAACTGAAC GCCGCCGCGA ACTCTATCAA
CAAACCAACA CAATCATTCA AACCGAATAT CCGTATGTGT TTCTGTGGGC CGATCGGATT
CCGGTCGCCT TAGCCAAACA GGTGCGTTCA ACTCAAGGCG AAATTCGGTT GGATACGGCC
AATTGGCTGT ATGATGTTCA ACATTGGTAT CTTGAGCAAT AA
 
Protein sequence
MMRILTLGLL AVLLTACSLG STPAPTNEPT TPPIQVPQGG TLTIRTAQDI AALHPWKPTC 
HEEAQLLGLL YRGLTKLDQS LAPQPDVATS WQSDSIGQTL TMTLRSDIRW HDDTTLTAAD
AAWTISAMQS ISPTTPLLTD LQGLVRKVTA PDDTTLVISL REPYAPLLSA LSMPILPKHV
FEQLSPVELD QLNLLTQPIG SGPFMFEQRT AGSAISLIRN SNYIDGVPYL DRVAFVVAPD
PQVARQAVRD GDLLAAELPW AQSQGLGPTV GIGSYPENGF YYLAFNMRDG RIFSDQRVRQ
ALALSLDLNT IVETAGPAAQ AILSDHLPGT WVAPTGELPK RNLDQARELL DQAGWVLPEG
ATIRASNGIT LSMALFVRGD DQRRIEVAER IAAATRPVGF NIVVTPADFE SVIRSKLVTP
FDFDLALMSW GNSRVGGSPS YTAYDPDNFS LFHSSQIYQG VADGRPGLRN YGAFQNTSFD
NLSTAARALY ATERRRELYQ QTNTIIQTEY PYVFLWADRI PVALAKQVRS TQGEIRLDTA
NWLYDVQHWY LEQ