Gene Haur_2846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2846 
Symbol 
ID5734727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3610294 
End bp3611301 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content50% 
IMG OID641279989 
Productperiplasmic solute binding protein 
Protein accessionYP_001545612 
Protein GI159899365 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCGCT GGTGGATTAT TAGCTTCCTT TCGCTGTTTA TCTTGGCAAG TTGCGGTGCT 
GAACAAGCCG CCCCAACCAC CCAAGGCCAA ACCGCCAAAC TCAACGTCGT CAGTACGGTT
TCGCCAATTA CCAATATTAT CTACAACATT GCAGGCGATA AAATTAGCCT AACCGGAATT
GTGCCTGAGG GTGTGAATTC CCACACCTTC GAGCCAGTGC CTTCAGATGC CAAAACCTTG
GCCGAGGCCG ACTTAATCTT TATCAATGGC CTTAATTTAG AGGAACCAAC CCACAAATTG
GCCGAAGCCA ACAAACAACC AAGTGCCGAA ATTATCTTGC TGGGCGAGCA AACGATCACG
CCTGAGCAAT ATGTCTACGA TTTCTCGTTT CCTAAAGAGG CTGGTAGCCC CAACCCGCAC
CTCTGGACAC ACCCGCTGCA TGGCTTGCGC TACGCCGAAA TTGTGCGTGA TGCCTTGGTG
CGCCGCGACC CGAGCAACGC TGAGTATTAC AACGCCAATT ATGCCAGCTT CAAAACCCGC
ATCGAAGCCT TTGATCTGGC AGTTAAAAAA ACGATCGAGA GTATCCCGGC TGAAAATCGC
AAATTGCTGA CCTACCACGA TTCATGGGCT TATTTCGCCC CGCACTATGG CATGACCGTG
ATTGGGGCAA TTCAGCCTGC TGATTTTGCC GAGCCATCGG CCAAAGATGT TGCCGATTTG
ATCACGCAGA TTCGTGAGCA AAAAGTGCCA GCGATTTTTG GCTCGGAAGT TTTTCCATCG
CCAGTATTAG AGCAAATTGG CCGCGAAACA GGCGTAAAAT ATATTGATAG CCTGCGCGAC
GACGATTTAC CTGGCGAGGT CGGGGCTGCC AATCACTCAT ATTTAGGCTT GCTGACCGAA
GATCTGCGGA TTATGGCCGA AAATTTGGGT GGCGACCCCA GCTTGATTGC CAATTTCGAT
ACCAGCAATA TTCCTGGCAG CGATAGCAGC GTCGTTCAAC AACAATAG
 
Protein sequence
MRRWWIISFL SLFILASCGA EQAAPTTQGQ TAKLNVVSTV SPITNIIYNI AGDKISLTGI 
VPEGVNSHTF EPVPSDAKTL AEADLIFING LNLEEPTHKL AEANKQPSAE IILLGEQTIT
PEQYVYDFSF PKEAGSPNPH LWTHPLHGLR YAEIVRDALV RRDPSNAEYY NANYASFKTR
IEAFDLAVKK TIESIPAENR KLLTYHDSWA YFAPHYGMTV IGAIQPADFA EPSAKDVADL
ITQIREQKVP AIFGSEVFPS PVLEQIGRET GVKYIDSLRD DDLPGEVGAA NHSYLGLLTE
DLRIMAENLG GDPSLIANFD TSNIPGSDSS VVQQQ