Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0993 |
Symbol | |
ID | 3746035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1120928 |
End bp | 1122058 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637769028 |
Product | cellulase |
Protein accession | YP_374898 |
Protein GI | 78186855 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.710198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.76733 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCATTG TTGGGTTGCT GGCCCTTGCC ACAGTCTGGC TGGTGTCATG CTGGCCGGCT GAAACCGATG ATGCCATCGT TCTCCGCAAA TCATGGGCCG GATACCTGCA CACGTTTGTC CAGGATGGCA GGGTGGTAAG GCCCCGGAAC GGCTTCGACA CCGTCTCTGA AGGTCAGGCC TACGCCATGA TCCGGGCGGT GACGGCGTCC GACCGGACCT CGTTCGATGC CATTCTCCTC TGGACTGAGA AAAACCTCTC CCGAAGAGAA CAGAGCGGTG ACAACCTTCT GGCCTGGCAT TATGCCGATG GTAGGGTCGT TGACTGGCAA GCGGCATCTG ACGCAGACAT CGATTATGCA TACAGCCTCC TGCTTGCCGG AAGGAAATGG GGGGATCCAT CCTATGCCAG GCTCGCACGA AAGGTGCTCG CCGATATTCT CCGCCTGGAA ACCATCTCCT ATGAAGGCAG GCTCCGCCTG CTTCCCTGGA ACCGGAAACC CACGGACGGG AATGGTTATG TGGTCCAGAA CCCATCCTAT TATTCACCAG CCCAGTTCAA GCTGTTCTTT GCCGAGACAG GTGACGGGCG ATGGCTTGAG CTTGCCGCAA CCGGCTATGA CCTGCTTGAC CGGCTGCAGG AGCCGGCCGG CGGCGGTGCC TTCCTCGTGC CGGACTGGTG CCGAATAGGC GAAGGGGGTG ATTTGAGGGA GCTTGAAGGC TATTCATCGC TCTACGGCTG GGATGCGTTA CGGGTCCCGA TGCGCATTGC ACTTGATCAT GCACTTTTTC ATGAGCCCAG AGCCGCCCGG GTGCTTGGGC GCTTTGCTGA ATTCTATACC AGCGAATTCA AGCGGTTCGG TCATGTGCAT TCCGTCTATT CAACCGGGGG CAGATCAGTG GTGTACGATG AAAACCCGCT TTCATATGCC GCTGCATACG CAGCCCTTGA GGCGTCAGGA TCGCCTCTGG CGGAAACAGC GTACCGCCGT CTGCAGCGGT TCAGCCACCT GAAGAAAGGC CACATCTACT ATCTGGACCG TAAGGATTAC TATGCCAACA GCCTTTCCTG GCTTCCCTCC TATTACCGTC TTCTCCTGAA CTCTCGAAAA TCTTCCCTCC AGGACAGGTA G
|
Protein sequence | MLIVGLLALA TVWLVSCWPA ETDDAIVLRK SWAGYLHTFV QDGRVVRPRN GFDTVSEGQA YAMIRAVTAS DRTSFDAILL WTEKNLSRRE QSGDNLLAWH YADGRVVDWQ AASDADIDYA YSLLLAGRKW GDPSYARLAR KVLADILRLE TISYEGRLRL LPWNRKPTDG NGYVVQNPSY YSPAQFKLFF AETGDGRWLE LAATGYDLLD RLQEPAGGGA FLVPDWCRIG EGGDLRELEG YSSLYGWDAL RVPMRIALDH ALFHEPRAAR VLGRFAEFYT SEFKRFGHVH SVYSTGGRSV VYDENPLSYA AAYAALEASG SPLAETAYRR LQRFSHLKKG HIYYLDRKDY YANSLSWLPS YYRLLLNSRK SSLQDR
|
| |