Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0301 |
Symbol | |
ID | 5732196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 359280 |
End bp | 361604 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277425 |
Product | hypothetical protein |
Protein accession | YP_001543081 |
Protein GI | 159896834 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4354] Predicted bile acid beta-glucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.158573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAC AACGCCACCA TTCCTCAATT CCTCTCGCTG CGTGGCAACG CCCACTTGGC CTCGATTATC CGAATGCGGC GATTGCCGCC CACGATCATG GGCCGATTAT TGACGATGGA GTCTTTCATG GTTTGCCAAT TGGTGGCATG GGTAGCGGGG CGATTGGCCG CAACTTTCGC GGCGATTGGT CACGCTGGCA TTTAGAGGTT GGCAAGCACG TACATCGCAG CGTTTGGCCG AATCAATGGA GTGTTTTCTG GCAAACCGCC AGTCAACAAG CCGCCCAAGT TTTATGCACG ACCCAACCAG ATACCGATGA ATTGAGTAGC TGGAACTGGA ATTATCCAGT TGGCGCTGGC AATTATCATG CACTTTTCCC CCGCGCTTGG TTCGATTATC AACACCCCGA TTGGCCACTG GAATTGGTTC AAGAGCAATT TTCGCCTGTT TTGGCGGGTA ATCTCAAGGA AAGCAGCTTT CCCGTTGGCG TGTTTACCTG GCGCGTAACC AATCGTGGCA GCGAAACCGT GCGTTTGGGA TTGATGCTGA CATGGGAACA TACCCGTGCG GTCGAAGCTG CTGGCTTAAG CTTGCAACGC CAACACAGCG CTTGGAACGA CGGTAACACC AGCGGCGTAA CCTTGACCCA AACCAGCGAT CAAGCCTTGA GCAGCCATAA TGGCACTTGG GCTTTGGCGG TGCAAGCCCC CGAATCGGCC AGCGTCAGTC AATGGACATG CTGGGATGTT GCGCAAGATG CTGCCGCGCT CTGGCAAGAT TTTGCCAGTG ACGGCCAATT AGCCGACTAT CCAACCAGCC AAAAAGTTGC TGCCGATCAA CGCAGCGCAA CGGCAATTGC CGTCACCCTC GAATTAGCGC CTGGTGCTAG CGCCGTGATT CCCTTCAGCC TCGCTTGGGA TTTTCCAATT GTCGAGTTTG CTGATCAAAG CCGCTGGTAC AAGCGCTATA CCCGTTTTTG GGGCACGAAC GGCGATCAAG CTCAAGCCTT GGCGGTTGCT AGTTTGACCA ATGCTGATGC TTGGCGTACA GCAATTGAAG CATGGCAAAA CCCCATTCTT GCTGATGATC AACGGCCATT TTGGTATAAA TCTGCTCTTT TTAACGAACT CTATTATTTG GTTGATGGTG GTACGTTGTG GGTTGATCGG GCGGTTGGCG GGCCGGAGCC TGCGGCTGAT GATGTGGGCT TGTTCAGCTA CCTCGAATGC TACGACTACC CGTTCTATGG CACGCTCGAT GTGTCGTTCT ACAGCTCGTG GAGCATCTTG GCCTTGTGGC CAGAGCTTGA ACGCGGCGAG ATTTTGGCCT TCTCCAAAAC GGTTAACGAT GCTGATGATA CCGTTGTCAC AATTGTGGCA ACTCAAGTCC CAGCAATTCG CAAGGCCGCT GGCGCATTGC CGCATGATCT TGGTGCGCCC AAAGAGCAAC CATTAATCAA AACCAATGCC TACGACTTCC AAGATATCAA TAACTGGAAA GATCTCAACC TCAAGTATAT TTTGCGAATC TATCGTGATG TGAGCTTGTG GAACGATCAG GCCATGCTGG AAGCAACTTG GGACACAATT CCAACTGCTT TAGAATATGT GCATCAATTC GATAGTGATG GCGATGGCTT GCTCGACCAT AGTGGAGCCG ACCAAACCTA CGATACCTGG GCCATGAGCG GCGCGGCCAG CTATTCGGCA AGCTTGCTGA TTTGTGCCTT GGAAGCTGCG ATTCGCCTAG CCCAACGCAT GGGCGACCAT GCCCAAGCCG ATGCTTGGAG TGAATGGCTG GCCGCGGCTC GCCAAAGTTT TGAAACTAAG CTTTGGAATG GTACTTACTT CCGCTATCAC ACCGCTGATA CTGATTTGCG CGAAGTGATT ATGGCCGATC AATTGGTGGG CCAATGGTAT GCAGGCGCAA TTGGCTTGCC AGCGGTTGCT CCACGCGAGA TGATTCGCTC GGCCTTGCAA ACGGTCTATC GCTTCAACGT CATGCAATAT GCCAACGGCG CATTGGGTGC AGTCAACGGC ATGCATCCAG ATGGCACGGT TGATACCAGC TCCAACCAAG CCAGCGAAGT TTGGAGCGGC ACGACCTATG CGATTGCGGC CATGATGCTG CAAGAAGGGC TTGATCTTGA AGGCTGGCAA ACCGCTTGGG GAGCCTATAA CGCCACCTAT AATGAACTTG GATTGTGGTT TCGCACACCG GAAGCATGGG GCATCGAACG AACTTTCCGT GCCAGCATGT ACATGCGACC ACAATCAATC TGGGCGATTG AGCATGCCTT AGCGGTGCGT GCTAAAAACG CCTGA
|
Protein sequence | MTEQRHHSSI PLAAWQRPLG LDYPNAAIAA HDHGPIIDDG VFHGLPIGGM GSGAIGRNFR GDWSRWHLEV GKHVHRSVWP NQWSVFWQTA SQQAAQVLCT TQPDTDELSS WNWNYPVGAG NYHALFPRAW FDYQHPDWPL ELVQEQFSPV LAGNLKESSF PVGVFTWRVT NRGSETVRLG LMLTWEHTRA VEAAGLSLQR QHSAWNDGNT SGVTLTQTSD QALSSHNGTW ALAVQAPESA SVSQWTCWDV AQDAAALWQD FASDGQLADY PTSQKVAADQ RSATAIAVTL ELAPGASAVI PFSLAWDFPI VEFADQSRWY KRYTRFWGTN GDQAQALAVA SLTNADAWRT AIEAWQNPIL ADDQRPFWYK SALFNELYYL VDGGTLWVDR AVGGPEPAAD DVGLFSYLEC YDYPFYGTLD VSFYSSWSIL ALWPELERGE ILAFSKTVND ADDTVVTIVA TQVPAIRKAA GALPHDLGAP KEQPLIKTNA YDFQDINNWK DLNLKYILRI YRDVSLWNDQ AMLEATWDTI PTALEYVHQF DSDGDGLLDH SGADQTYDTW AMSGAASYSA SLLICALEAA IRLAQRMGDH AQADAWSEWL AAARQSFETK LWNGTYFRYH TADTDLREVI MADQLVGQWY AGAIGLPAVA PREMIRSALQ TVYRFNVMQY ANGALGAVNG MHPDGTVDTS SNQASEVWSG TTYAIAAMML QEGLDLEGWQ TAWGAYNATY NELGLWFRTP EAWGIERTFR ASMYMRPQSI WAIEHALAVR AKNA
|
| |