Gene GM21_3517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3517 
Symbol 
ID8138889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4058415 
End bp4059428 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID644871136 
Producthypothetical protein 
Protein accessionYP_003023296 
Protein GI253702107 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0000347699 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCGGCCTG GCCCTGCGGC GCTCCCCAAG TTTTCCATCA TCATACCGGT GAAGCCGGGG 
GGCGAGGTGC GCGCCCTGTC CGGCCTGAAC CAGGCGACCT ACCCGGAAGA GCTGTTCGAG
GTGCTGATCG CCTACGGCCG TCAGCCCAGC GTGCAGAGAA ACGTCGCCGC CCGCGACGCG
AAGGGAGAGA TCCTTTATTT CCTGGACGAC GACTCCCTGG TCGCCCCGGG GTTTCTCGAG
CGCGCGGCGT CACACTACCG GGAGCCGAAG GTCGCGGCGG TTGGCGGGCC TTCTCTTACC
CCCGCCAACG ATTCCCCCCT GCAAAGAGCG ATAGGGACCG CCTTCACCTC CCCCGTCGGC
GGGGGAGGAG TCCGCAACCG CTACCGCAAA AACGGTAGCG CCCGGTACAG CAACGACAGC
GAACTCATCC TGTGCAATTT GAGCTTCAGG CGCGATATCT TCCTGACCCA CGAGGGGTTG
GATGAGCGGC TCTATCCCAA CGAAGAGAAC GAGCTGATGG ACCGACTGCA ACAGGAGGGT
CACCTGCTGG TGCACGACCC CGAGCTGGCC ATCGTGCGCA GCCAGCGCAA CACCTATCGC
GCCTATGTGA GGCAGATGTA CGGCTACGGC CGGGGACGCG GAGAGCAGAC CCTGATATCG
GGGCAGTTGA AGCCTGTCTC CCTGGTGCCG TCGCTGTTTC TGATCTACCT CCTGTCGCTC
CCATTTCTCG GTGGGGGCGT ATTTTTGCTG CCGCTTCTTT GCTACCTGGC GGTTGTCGCG
GCGGCCTCCG TCGCCGGGAG CATTTCCGGT CGCGACCTGG CGCTTTTGCC GAGGCTGTTG
CTGGTCTTTC CGACGCTGCA CCTGGTCTAC GGCGCCGGTG TCCTGCGCGG CCTGACGCGC
CCCCGTTATC GTGGGGGGAG GCAGACCCAC TGGGAAGTCG AAGTCAGGCG GGTGAAGGCA
TTCTCGGAAC CTGTAATTAA CCGTTCAACA ACGGTTCTCA ACGTTGAACG TTGA
 
Protein sequence
MRPGPAALPK FSIIIPVKPG GEVRALSGLN QATYPEELFE VLIAYGRQPS VQRNVAARDA 
KGEILYFLDD DSLVAPGFLE RAASHYREPK VAAVGGPSLT PANDSPLQRA IGTAFTSPVG
GGGVRNRYRK NGSARYSNDS ELILCNLSFR RDIFLTHEGL DERLYPNEEN ELMDRLQQEG
HLLVHDPELA IVRSQRNTYR AYVRQMYGYG RGRGEQTLIS GQLKPVSLVP SLFLIYLLSL
PFLGGGVFLL PLLCYLAVVA AASVAGSISG RDLALLPRLL LVFPTLHLVY GAGVLRGLTR
PRYRGGRQTH WEVEVRRVKA FSEPVINRST TVLNVER