Gene Haur_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0301 
Symbol 
ID5732196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp359280 
End bp361604 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content53% 
IMG OID641277425 
Producthypothetical protein 
Protein accessionYP_001543081 
Protein GI159896834 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4354] Predicted bile acid beta-glucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAC AACGCCACCA TTCCTCAATT CCTCTCGCTG CGTGGCAACG CCCACTTGGC 
CTCGATTATC CGAATGCGGC GATTGCCGCC CACGATCATG GGCCGATTAT TGACGATGGA
GTCTTTCATG GTTTGCCAAT TGGTGGCATG GGTAGCGGGG CGATTGGCCG CAACTTTCGC
GGCGATTGGT CACGCTGGCA TTTAGAGGTT GGCAAGCACG TACATCGCAG CGTTTGGCCG
AATCAATGGA GTGTTTTCTG GCAAACCGCC AGTCAACAAG CCGCCCAAGT TTTATGCACG
ACCCAACCAG ATACCGATGA ATTGAGTAGC TGGAACTGGA ATTATCCAGT TGGCGCTGGC
AATTATCATG CACTTTTCCC CCGCGCTTGG TTCGATTATC AACACCCCGA TTGGCCACTG
GAATTGGTTC AAGAGCAATT TTCGCCTGTT TTGGCGGGTA ATCTCAAGGA AAGCAGCTTT
CCCGTTGGCG TGTTTACCTG GCGCGTAACC AATCGTGGCA GCGAAACCGT GCGTTTGGGA
TTGATGCTGA CATGGGAACA TACCCGTGCG GTCGAAGCTG CTGGCTTAAG CTTGCAACGC
CAACACAGCG CTTGGAACGA CGGTAACACC AGCGGCGTAA CCTTGACCCA AACCAGCGAT
CAAGCCTTGA GCAGCCATAA TGGCACTTGG GCTTTGGCGG TGCAAGCCCC CGAATCGGCC
AGCGTCAGTC AATGGACATG CTGGGATGTT GCGCAAGATG CTGCCGCGCT CTGGCAAGAT
TTTGCCAGTG ACGGCCAATT AGCCGACTAT CCAACCAGCC AAAAAGTTGC TGCCGATCAA
CGCAGCGCAA CGGCAATTGC CGTCACCCTC GAATTAGCGC CTGGTGCTAG CGCCGTGATT
CCCTTCAGCC TCGCTTGGGA TTTTCCAATT GTCGAGTTTG CTGATCAAAG CCGCTGGTAC
AAGCGCTATA CCCGTTTTTG GGGCACGAAC GGCGATCAAG CTCAAGCCTT GGCGGTTGCT
AGTTTGACCA ATGCTGATGC TTGGCGTACA GCAATTGAAG CATGGCAAAA CCCCATTCTT
GCTGATGATC AACGGCCATT TTGGTATAAA TCTGCTCTTT TTAACGAACT CTATTATTTG
GTTGATGGTG GTACGTTGTG GGTTGATCGG GCGGTTGGCG GGCCGGAGCC TGCGGCTGAT
GATGTGGGCT TGTTCAGCTA CCTCGAATGC TACGACTACC CGTTCTATGG CACGCTCGAT
GTGTCGTTCT ACAGCTCGTG GAGCATCTTG GCCTTGTGGC CAGAGCTTGA ACGCGGCGAG
ATTTTGGCCT TCTCCAAAAC GGTTAACGAT GCTGATGATA CCGTTGTCAC AATTGTGGCA
ACTCAAGTCC CAGCAATTCG CAAGGCCGCT GGCGCATTGC CGCATGATCT TGGTGCGCCC
AAAGAGCAAC CATTAATCAA AACCAATGCC TACGACTTCC AAGATATCAA TAACTGGAAA
GATCTCAACC TCAAGTATAT TTTGCGAATC TATCGTGATG TGAGCTTGTG GAACGATCAG
GCCATGCTGG AAGCAACTTG GGACACAATT CCAACTGCTT TAGAATATGT GCATCAATTC
GATAGTGATG GCGATGGCTT GCTCGACCAT AGTGGAGCCG ACCAAACCTA CGATACCTGG
GCCATGAGCG GCGCGGCCAG CTATTCGGCA AGCTTGCTGA TTTGTGCCTT GGAAGCTGCG
ATTCGCCTAG CCCAACGCAT GGGCGACCAT GCCCAAGCCG ATGCTTGGAG TGAATGGCTG
GCCGCGGCTC GCCAAAGTTT TGAAACTAAG CTTTGGAATG GTACTTACTT CCGCTATCAC
ACCGCTGATA CTGATTTGCG CGAAGTGATT ATGGCCGATC AATTGGTGGG CCAATGGTAT
GCAGGCGCAA TTGGCTTGCC AGCGGTTGCT CCACGCGAGA TGATTCGCTC GGCCTTGCAA
ACGGTCTATC GCTTCAACGT CATGCAATAT GCCAACGGCG CATTGGGTGC AGTCAACGGC
ATGCATCCAG ATGGCACGGT TGATACCAGC TCCAACCAAG CCAGCGAAGT TTGGAGCGGC
ACGACCTATG CGATTGCGGC CATGATGCTG CAAGAAGGGC TTGATCTTGA AGGCTGGCAA
ACCGCTTGGG GAGCCTATAA CGCCACCTAT AATGAACTTG GATTGTGGTT TCGCACACCG
GAAGCATGGG GCATCGAACG AACTTTCCGT GCCAGCATGT ACATGCGACC ACAATCAATC
TGGGCGATTG AGCATGCCTT AGCGGTGCGT GCTAAAAACG CCTGA
 
Protein sequence
MTEQRHHSSI PLAAWQRPLG LDYPNAAIAA HDHGPIIDDG VFHGLPIGGM GSGAIGRNFR 
GDWSRWHLEV GKHVHRSVWP NQWSVFWQTA SQQAAQVLCT TQPDTDELSS WNWNYPVGAG
NYHALFPRAW FDYQHPDWPL ELVQEQFSPV LAGNLKESSF PVGVFTWRVT NRGSETVRLG
LMLTWEHTRA VEAAGLSLQR QHSAWNDGNT SGVTLTQTSD QALSSHNGTW ALAVQAPESA
SVSQWTCWDV AQDAAALWQD FASDGQLADY PTSQKVAADQ RSATAIAVTL ELAPGASAVI
PFSLAWDFPI VEFADQSRWY KRYTRFWGTN GDQAQALAVA SLTNADAWRT AIEAWQNPIL
ADDQRPFWYK SALFNELYYL VDGGTLWVDR AVGGPEPAAD DVGLFSYLEC YDYPFYGTLD
VSFYSSWSIL ALWPELERGE ILAFSKTVND ADDTVVTIVA TQVPAIRKAA GALPHDLGAP
KEQPLIKTNA YDFQDINNWK DLNLKYILRI YRDVSLWNDQ AMLEATWDTI PTALEYVHQF
DSDGDGLLDH SGADQTYDTW AMSGAASYSA SLLICALEAA IRLAQRMGDH AQADAWSEWL
AAARQSFETK LWNGTYFRYH TADTDLREVI MADQLVGQWY AGAIGLPAVA PREMIRSALQ
TVYRFNVMQY ANGALGAVNG MHPDGTVDTS SNQASEVWSG TTYAIAAMML QEGLDLEGWQ
TAWGAYNATY NELGLWFRTP EAWGIERTFR ASMYMRPQSI WAIEHALAVR AKNA