Gene Haur_4683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4683 
Symbol 
ID5736530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5981095 
End bp5982789 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content53% 
IMG OID641281847 
Productextracellular solute-binding protein 
Protein accessionYP_001547442 
Protein GI159901195 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGAC GTATTCGTTG GCAAATTGGA GCCACGTTGG TTTGTGCTAG TCTGGTTTTG 
TTGTTGTTGG CGCATCTTGC GCTAGCAACA ACAGCAGGCG GCGAGGCAGG CGTTGGCGGG
GTCTATCGCG AGGCCTTGGT TGGCGAGGTC AATGATCTTA ACCCATTGCG CGGTAGTGCC
CAAACCCGCG CCGAGGCCGA CTTAAGTGCT TTGTTGTTTG AAGGCTTGAC CCGCGTTGAG
GCAACTGGCG TGGTCGTGCC CGATCTCGCC GAAAATTGGC AGGTCAGCAG CAATAATCTG
GTTTATACCA TGACCTTGCG CACTGATGTG CGCTGGCACG ATGCCACACC TTTTTCTGCA
AACGATGTGA TCTGGACGAT CGACTGGATC AAAAGCCCAC AATTTAATGG CGACCCCAGC
CTTAAGTTAG CTTGGCAAAA TATCGCCGTC ACCGCGTTGA ATCCGCAAAC TGTGCGTTTT
AACTTGCCAG TTGCTTTTGC GCCCTTCCTC AACCAACTAA CTGTGCCAAT TTTGCCCGCG
CATCTGTTGC GCAACGCTAC GCCCGACCAA TGGCAAGCTT GGAATCAGCA ACCAGTTGGC
ACTGGCAGTT TTCGCTTTGG TAAACGCGAT GGCAACTTAA TCGAGCTATT GGCCAACACT
GATTACCGTC CGAATATTCC CAATAGTCGC CCCAATTTGG AGCGGCTGGA GTTTCGGCTT
TACGCAACCA TGGAAGCTGC GCATGAAGCA CTGCGCCGTG ATTTGGTCGA TGCGTTTGCC
TACGATGTGA CTGAACAAGG CGAATTACCG TTGCCAATCG GCTATCGCCA GGTGCGTGCG
CCTTTGGCTG ATTATGTGGT CCTGACGATT AATTTGCGTC GTCCGCCACT TGATGATTTA
CGGGTGCGCC AAGCGATCGA TTATGCAGTT GACCCAACTA CGTTGATTAC TGAAGTGTTG
CAAAATCGGG CGTTGCCCTT GAATTCGCCA ATTTTGCCTT CGTCGTGGGC GGTTGCTGAG
GGTCTCACCG GCTATATTGA TGATGCTGAG CAGAGTCGCG CCAATGGGTT GCTTGATCAA
GCTGGTTGGC AGCGTGATGC CGAAGGGTGG CGGCGTAAAG CTGGCCAATT GTTGCGCTTC
GATTTGGTCA GTGCCAACAG CCCCGAACGC AGCGCCATCG CCGAACGCAT TCAAGCCCAA
TTAGCCACGG TCGGTATTTC AAGCAGTTTG CAATTGGTTC CGAGCAGCAA TTTGCAAGAT
ACGCTGAGCC AACATACCTT CGATTTAGCA ATTCATGGTT GGTCGAATTT AGGCAGCGAC
CCCGATGTTT ACGAGTTATG GCATTCGTCG CAGGTCAACG CCGCCAACTT TGCTGGCCTC
GAAGACCCAG CAATTGATGA TTTATTGGTG CGTGCTCGCC AAACCACCAG CCAAGAAACC
CGCCGCGAAT TGTATGCTGA ATTTCAGGCC AAATGGCTAG CACTCGCGCC CTCAGTGATG
CTCTATCAGC CGTTGTTGGA GCAGCAGGTT GGACCATCGA TCGAGGTGCT TGGTCTGGCT
CCAATCAACC AACAAGCTGA AGTGCTCTAT CGGCATAGCG ATCGCTTCCG GCGGATCGCC
AATTGGTATC TAGTATCCAC CCGTCAAGTC TTGCCCAACT TCCGAAATGA ACCACAAAAC
GAACGCCCAC GCTAA
 
Protein sequence
MVRRIRWQIG ATLVCASLVL LLLAHLALAT TAGGEAGVGG VYREALVGEV NDLNPLRGSA 
QTRAEADLSA LLFEGLTRVE ATGVVVPDLA ENWQVSSNNL VYTMTLRTDV RWHDATPFSA
NDVIWTIDWI KSPQFNGDPS LKLAWQNIAV TALNPQTVRF NLPVAFAPFL NQLTVPILPA
HLLRNATPDQ WQAWNQQPVG TGSFRFGKRD GNLIELLANT DYRPNIPNSR PNLERLEFRL
YATMEAAHEA LRRDLVDAFA YDVTEQGELP LPIGYRQVRA PLADYVVLTI NLRRPPLDDL
RVRQAIDYAV DPTTLITEVL QNRALPLNSP ILPSSWAVAE GLTGYIDDAE QSRANGLLDQ
AGWQRDAEGW RRKAGQLLRF DLVSANSPER SAIAERIQAQ LATVGISSSL QLVPSSNLQD
TLSQHTFDLA IHGWSNLGSD PDVYELWHSS QVNAANFAGL EDPAIDDLLV RARQTTSQET
RRELYAEFQA KWLALAPSVM LYQPLLEQQV GPSIEVLGLA PINQQAEVLY RHSDRFRRIA
NWYLVSTRQV LPNFRNEPQN ERPR