Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0339 |
Symbol | |
ID | 3748059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 381437 |
End bp | 382633 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637772866 |
Product | endoglucanase Y-like |
Protein accession | YP_378655 |
Protein GI | 78188317 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0387741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCGTT TTTGGACTTT TTTGTTGATT AGTTGTGCTC CTTTCTTTTT ATCGAGTTGC TCATCTGTTC CACAAAATCC AGAAAAGATT TCTACGGAAG CATGGCACTA CTATCGCGAT ACGTTTATAA AAAATGGTCG CGTTATTCGT CCTCAAAACA ATAATGATAC TGTTTCGGAG GGGCAAGCTT ACACAATGGT TCGTGCGGTT TTAATGAAGG ATCGCAAAAC CTTTGATGAA TGCCTTGCTT GGAGCGAAAA GGTGCTCTCC CGTAAAAATA GTGATGGTGA TTATTTGCTT GCATGGCATT ACCGTGATGG CAAAGTTACC GATACAACGG CAGCCTCTGA TGCCGATATT GATTATGCTT TTAGCTTAAT TGTTGCCTCA AAAATATGGC AGGCACCTCG TTACCTTGAG CTTGCTAAAG AGGTGCTTGC AAGCATTCTG GAAGCGGAAA CCACGCGCCA TCAAGGGCGT TTATATTTAT TGCCGTGGCC TGCAAACAAA AACAAGCCAG GTGATTTATT AGCACAAAAC CTCTCTTACT ACGCACCTTC CCATTTCAAG CTTTTTTACG AAACCACAAG CGACCCTCGT TGGCTTGAGC TTGTGGATAC AACCTACTAC TTATTGGGAC GCTTGCTGCA CCCTGGTGAA TTACCAGAGG GACCTATTGT GCCTGATTGG ATTGCTATAA ATGATGCTGG CGCTTTTGTG CATTTACCGG GTAAAGATGT ACGCTATGGC TGGGATGCTG TAAGAGTGCC AATGAGAATT GCGGCTGATT ACCATTTATA TGGCGATAAA CGTGCTTTTG AGGTGCTCAG TTGGTTAGCG GTATCGTTTG AGGAGGAATT TCGCCAACAA TCAAAATTTT TGTTACAGCG AGATTCAACC CTCCAAGTGC GTAATAATGC CCTTTTTTAT AGCGCTATGT ATGCTTCGTT AGAAGCAACG GAATCGCCAA GTGCCCCTAA GTTGCTACAA CGCATTCGCA AGTTTATTAG GCAAGAGAAG CAAGGTTTGT TTTATAATCA TCCTGATGAT TACTACATCA ACAGCCTATG CTGGATTACG GAGTATTACG AGCAAAACAA AAAACACTTA CAAGCACGTT CTAAAAAAGT TGCTCTTCCA TTACAAACCC ACGAAAGTAC AGCAAACACG GCATCCTTGA GTTTACAAGC ACCATAG
|
Protein sequence | MRRFWTFLLI SCAPFFLSSC SSVPQNPEKI STEAWHYYRD TFIKNGRVIR PQNNNDTVSE GQAYTMVRAV LMKDRKTFDE CLAWSEKVLS RKNSDGDYLL AWHYRDGKVT DTTAASDADI DYAFSLIVAS KIWQAPRYLE LAKEVLASIL EAETTRHQGR LYLLPWPANK NKPGDLLAQN LSYYAPSHFK LFYETTSDPR WLELVDTTYY LLGRLLHPGE LPEGPIVPDW IAINDAGAFV HLPGKDVRYG WDAVRVPMRI AADYHLYGDK RAFEVLSWLA VSFEEEFRQQ SKFLLQRDST LQVRNNALFY SAMYASLEAT ESPSAPKLLQ RIRKFIRQEK QGLFYNHPDD YYINSLCWIT EYYEQNKKHL QARSKKVALP LQTHESTANT ASLSLQAP
|
| |