Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2464 |
Symbol | |
ID | 8137805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2879608 |
End bp | 2880741 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870074 |
Product | putative cellulose biosynthesis (CelD) protein |
Protein accession | YP_003022265 |
Protein GI | 253701076 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 158 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAGT TCGAGACCGA AATCGTACAA GACCCGAAAC GGTTGGCCGC GCTAAAGCCT GAGTGGGATG CGCTTTTCGC GACGATGGAT GCTGACCGCA GCGTTTTCAT GAGTCACCTG TGGTACGACT GCTGGTGGAG GCACTTTGGC AAGAAGGCGG AGCTTTTCCT CCTGGTGCTG CGCAGGCGGG GGGAGACGGT GGGTATCGCC CCCCTGATGC GACGTCGCCT CACTCTGCAC GGGCTCCCGG TGCGGGGTTT GGTCTTCGTG GCAAACGGCA ATTCGCTGCA CAACGACCTT CTGGTGCGTG GAGAGGATCG GGACGAGGTA CTGGAGGAAC TGGCCCGCTT CCTGTGCGAG GAGAGGGTGG GATGGCAGTA TATCGAATTG CATCATTTTC CTGCCGTCTC CCCCAACTGC GCCGGGTTCG CGGCGGCTTT GGCCAGGCGG AACGTTCCCG TCAGGCTTTT TTCCTCCTAT GACTCCCCAT TCCTGGAGGT AAGGGGCGGG TGGCAGCAGT TCATCTCCTC CCGTTCCCAA AGGGTGCGCA AGACCTTGCG CAACATCGCG AACACCATGG AGCGAAGCGG CGTCGTCGAG GTGAGCGAGG TCACCGACTG GGACGGATAT ATGTCGGTGC GCGAGGATGT GTTGCGGATC GCCAGGAATA GCTGGACCCA TCGGGTGGGG GACTCCCTGG CCCATCCGTT GAACGGTCCC TTCTTCGAGG AGCTGGCATA TGGCGCCGCA AAAGCCGGAT GGCTCTCGCT TTGGCTGTTG CGGCTGGACG GCAAAGCTGT CGCGTTCGAG TACCACCTGC GGGGCTGCGG GAAGGAGCAC GCGTTGCGCG GCTCCTATGA CGAGGAGTTC CACCGGCTTT CTCCCGGCGC TTTCCTGGAG ACCGAGATAC TTAAGCGCAT CTTCAGCGAG CCGCACGTGG TGGAACGCTA CGACTTCGGC GGGAGCTTCG ACGACTACAA GAAGCGCTGG AGCGATAGCT CTTTGGACCA TGCAACGATC TGCGCCTTCA ACAAGGGGGC CTACTGCCGG TTGGCGGCCT TCCACGAACT GGCCATCGTC GACACGGCGC GTCGACTGCG CAACCTCTGG AGAAAGAGAA ATGTCAACGC TTAG
|
Protein sequence | MSQFETEIVQ DPKRLAALKP EWDALFATMD ADRSVFMSHL WYDCWWRHFG KKAELFLLVL RRRGETVGIA PLMRRRLTLH GLPVRGLVFV ANGNSLHNDL LVRGEDRDEV LEELARFLCE ERVGWQYIEL HHFPAVSPNC AGFAAALARR NVPVRLFSSY DSPFLEVRGG WQQFISSRSQ RVRKTLRNIA NTMERSGVVE VSEVTDWDGY MSVREDVLRI ARNSWTHRVG DSLAHPLNGP FFEELAYGAA KAGWLSLWLL RLDGKAVAFE YHLRGCGKEH ALRGSYDEEF HRLSPGAFLE TEILKRIFSE PHVVERYDFG GSFDDYKKRW SDSSLDHATI CAFNKGAYCR LAAFHELAIV DTARRLRNLW RKRNVNA
|
| |