Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0174 |
Symbol | |
ID | 6373828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 169183 |
End bp | 170934 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642682693 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_001958630 |
Protein GI | 189499160 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.655452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATTC GATATCTTAT GGCTGCAGTC TTTCTGCTGC TCTATACGTT TTCTTCTCCT CCCTCCAGAC CCGCTCTTGC AGAGGCCTTT CCGGCTTACA AGAACGCAAC AGCACAGGAA ATATTCAGAG AAAAAGACAA GTGGGTTGAA AAGCAGCTCA GCGAGATGAC GCTTTCTGAT AAAATCGGTC AGATGCTAAT CGCTCACAGC CCGGCAAAAT TCCGAAGTAC TGACGACAGT TACTACAAGA AACTTTCCCT TCTGGTAAGC CAGGGTAAAG TCGGCGGGAT CATGTTTCTC AAAGGCAATA CCAACGATGC CGCTGTTCTT GCCAACAGGT TTCAGTTTAT TGCTCCAAGA CCGCTGCTTA TCAGTGCGGA TATGGAAAAA GGACTTGCCA TGAGACTTGA CGGCGCCACA GAGTTTGCTC CAAGCATGGC CCTTTCGGCA ACAGGCAGAC CGGATCTTGT CTTCAAAATG GCTGGCGTGA TCGCTCAGGA AGCCAAAGCA CTGGGCATCT ACCACAGTTA CGGGCCCAGT GTCGATCTGA ACACTAACCC GCTCAATCCG GTGATCAATA CCAGGTCATA CGGTGATAAC ATCCCCTTGA CCATAGAGAT GTCGAATGCG TTTATTGACG GACTGCAATC GAACGGTATC ATCGCCACAG CAAAGCATTT TCCCGGACAC GGGGACGTCA CGGTCGACAG TCATATCAAT CTTCCTGTTC TCAACGCGGA TAAAAAACGT CTGGAACGCG TTGAACTGAA ACCATTCATA GCAGCTATAG ACCACGGAAT AATGAGCATC ATGATCGGCC ACCTCGCGAT CCCGGCTTAT ACAGGCAGCA TGACACCGGC GACGCTCTCA TGGAGAATTG TCACAAAACT CCTGAGAAAG GAACTGGGTT TCGATGGCCT CATCATTACC GACGCGCTGA ACATGAAGGC GCTCTATCAG TCCTACACTC TTGAAGATAT TTCTTTACGT GCCGTTGAAG CAGGCAACGA CCTGCTTCTT TTCTCACCTG ACCCGGAACG TACCCACACC ACCCTGCTCA ACGCTGTGAG AAGAGGCAAA CTCTCGGAAA AACAGATCAA CAAGTCCGTA CGCCGGATAC TTCTGGCCAA AAGGTGGCTT GGCCTTGATA AAAATCGCCT GGTCAACCTG AACAGTATCC ACGGCCAGAT GAACCTGAAA AGCCATCGGG AACTTGCAGA GAATATCGCC GACAACGCTA TAACCGTCAT AAGAGACAAG CATCAGGCCC TTCCTGTCAG GCAAGAAAAC AAAAACAACA TCCTGCATAT CGTTCTCGAA AACAAGCGCT ACTCACTGTC GGGAGAATCT TTTTCAGACA AGCTGTACAG GGCATTCCAG GCTAAAACCA TACGTCTGGA CCATAACTCC AGCGCCCGTG ACTATCTCGA CGCTGCTGAT AAAGCCAAAC GCGCGTCGAC CATTATCGTC TCAACCTATG TTGAAGTGCT TTCCGGCACA AAGTCACTGG CTGTAAGTAA AGGGCAGGAG GAATTCATCA GCACACTTGT TCGCGATCTG CCGTCAAAGC GTTCATGTAT TATGATTTCA TTCGGAACGC CCTACCTGAT CAACCAGTTT CCCGACATAC CTGCTTTCAT CTGTACCTAC TCATCTTCTG AGCTCAGTGA AGATTCCGCC GTCAGGCTGC TGCAGGGAAA AATCAAGCCG ACAGGAAAAC TCCCCATATC CCTTACGGAA AACCGGCGGT AA
|
Protein sequence | MLIRYLMAAV FLLLYTFSSP PSRPALAEAF PAYKNATAQE IFREKDKWVE KQLSEMTLSD KIGQMLIAHS PAKFRSTDDS YYKKLSLLVS QGKVGGIMFL KGNTNDAAVL ANRFQFIAPR PLLISADMEK GLAMRLDGAT EFAPSMALSA TGRPDLVFKM AGVIAQEAKA LGIYHSYGPS VDLNTNPLNP VINTRSYGDN IPLTIEMSNA FIDGLQSNGI IATAKHFPGH GDVTVDSHIN LPVLNADKKR LERVELKPFI AAIDHGIMSI MIGHLAIPAY TGSMTPATLS WRIVTKLLRK ELGFDGLIIT DALNMKALYQ SYTLEDISLR AVEAGNDLLL FSPDPERTHT TLLNAVRRGK LSEKQINKSV RRILLAKRWL GLDKNRLVNL NSIHGQMNLK SHRELAENIA DNAITVIRDK HQALPVRQEN KNNILHIVLE NKRYSLSGES FSDKLYRAFQ AKTIRLDHNS SARDYLDAAD KAKRASTIIV STYVEVLSGT KSLAVSKGQE EFISTLVRDL PSKRSCIMIS FGTPYLINQF PDIPAFICTY SSSELSEDSA VRLLQGKIKP TGKLPISLTE NRR
|
| |