Gene GM21_2464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2464 
Symbol 
ID8137805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2879608 
End bp2880741 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content61% 
IMG OID644870074 
Productputative cellulose biosynthesis (CelD) protein 
Protein accessionYP_003022265 
Protein GI253701076 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones158 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGT TCGAGACCGA AATCGTACAA GACCCGAAAC GGTTGGCCGC GCTAAAGCCT 
GAGTGGGATG CGCTTTTCGC GACGATGGAT GCTGACCGCA GCGTTTTCAT GAGTCACCTG
TGGTACGACT GCTGGTGGAG GCACTTTGGC AAGAAGGCGG AGCTTTTCCT CCTGGTGCTG
CGCAGGCGGG GGGAGACGGT GGGTATCGCC CCCCTGATGC GACGTCGCCT CACTCTGCAC
GGGCTCCCGG TGCGGGGTTT GGTCTTCGTG GCAAACGGCA ATTCGCTGCA CAACGACCTT
CTGGTGCGTG GAGAGGATCG GGACGAGGTA CTGGAGGAAC TGGCCCGCTT CCTGTGCGAG
GAGAGGGTGG GATGGCAGTA TATCGAATTG CATCATTTTC CTGCCGTCTC CCCCAACTGC
GCCGGGTTCG CGGCGGCTTT GGCCAGGCGG AACGTTCCCG TCAGGCTTTT TTCCTCCTAT
GACTCCCCAT TCCTGGAGGT AAGGGGCGGG TGGCAGCAGT TCATCTCCTC CCGTTCCCAA
AGGGTGCGCA AGACCTTGCG CAACATCGCG AACACCATGG AGCGAAGCGG CGTCGTCGAG
GTGAGCGAGG TCACCGACTG GGACGGATAT ATGTCGGTGC GCGAGGATGT GTTGCGGATC
GCCAGGAATA GCTGGACCCA TCGGGTGGGG GACTCCCTGG CCCATCCGTT GAACGGTCCC
TTCTTCGAGG AGCTGGCATA TGGCGCCGCA AAAGCCGGAT GGCTCTCGCT TTGGCTGTTG
CGGCTGGACG GCAAAGCTGT CGCGTTCGAG TACCACCTGC GGGGCTGCGG GAAGGAGCAC
GCGTTGCGCG GCTCCTATGA CGAGGAGTTC CACCGGCTTT CTCCCGGCGC TTTCCTGGAG
ACCGAGATAC TTAAGCGCAT CTTCAGCGAG CCGCACGTGG TGGAACGCTA CGACTTCGGC
GGGAGCTTCG ACGACTACAA GAAGCGCTGG AGCGATAGCT CTTTGGACCA TGCAACGATC
TGCGCCTTCA ACAAGGGGGC CTACTGCCGG TTGGCGGCCT TCCACGAACT GGCCATCGTC
GACACGGCGC GTCGACTGCG CAACCTCTGG AGAAAGAGAA ATGTCAACGC TTAG
 
Protein sequence
MSQFETEIVQ DPKRLAALKP EWDALFATMD ADRSVFMSHL WYDCWWRHFG KKAELFLLVL 
RRRGETVGIA PLMRRRLTLH GLPVRGLVFV ANGNSLHNDL LVRGEDRDEV LEELARFLCE
ERVGWQYIEL HHFPAVSPNC AGFAAALARR NVPVRLFSSY DSPFLEVRGG WQQFISSRSQ
RVRKTLRNIA NTMERSGVVE VSEVTDWDGY MSVREDVLRI ARNSWTHRVG DSLAHPLNGP
FFEELAYGAA KAGWLSLWLL RLDGKAVAFE YHLRGCGKEH ALRGSYDEEF HRLSPGAFLE
TEILKRIFSE PHVVERYDFG GSFDDYKKRW SDSSLDHATI CAFNKGAYCR LAAFHELAIV
DTARRLRNLW RKRNVNA