Gene Gmet_3529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_3529 
Symbol 
ID3739788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3962463 
End bp3963512 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID637780818 
Productcapsule biosynthesis protein, putative 
Protein accessionYP_386459 
Protein GI78224712 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.198641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAGG TCCGGCTTAT GCTTGCCGCC ATCGTCACCC TCCTTGCCGT CCCGGCCCTG 
GGCGCCGACG GAATCGTCAT CACCTGCGTC GGCGACATCA TGCTCGCCGG CAGCGCCACA
CCGACCCTCT CCCGGTCGGG ATACGACTAC CCCTTCGCGA AGACGGCCCA GGAACTTCGG
CGGGGCGATA TTGCCATGGG GAACCTGGAG GCCCCCCTTA CGGAGCGCGG AACCGAGTAC
CGGGACAAAA CGTACCGTTT CCGGACAAAC CCCATCGCTG CAGCAGCCTT GAAGCGGGCC
GGCTTCTCGG TCCTCACCCT GGCCAACAAC CACATGATGG ATTACGGAAA TGAGGGACTC
CAGGACACCC TGGCGACCCT CTCCCGCCAC GGCATTGCCC ACACGGGCGC CGGCGCGTCA
CTGGCCGAGG CCCGCCGGGA GGCGGTGGTC TCGGTGCGGG GGAAGCGGAT CGCCTTCCTC
GCTTATTCCC TCACCTTTCC GTCGGAGTTC TATGCTGGCC CGAACCGGCC AGGCACCGCC
CCCGGCTACG CCCCCCATGT ACGGGAGGAT ATCAGGCGGG CGAAGGCGGA GGCCGACTAC
GTGGTGGTCT CGTTCCACTG GGGGGCGGAA CGGGCAGAGT TTCCGAAGCA GTACCAGACG
GAGACTGCCC GATTGGCCAT TGATGCCGGC GCCGACGCCA TCATCGGCCA CCACCCCCAC
GTGCTCCAGG GGATCGAATT CTACCGGGGA AAGCCGATTC TCTACAGCCT CGGCAACTTC
GCCTTCGGCA GCCGGAGCAC CGCCGCCGAT CGGAGCGTCA TGGCACGGCT GACCCTCTCC
GACGAAGAAA CCTCCGTGGA ACTGGTACCC CTGAACGTTC TGCACCGGGA GACCCGCTAC
CAGCCCGGCA TCCTTGCGGG GCGCAAGGGA GCGGAGGTTA TCGAGCGGCT GAACCGGCTG
TCGCAACCGT TCGGCACGGT GATTTCGGGT TCTGCGGGGC GCTTCAGGGC AAGAACATCC
GGAGCCGACC AGCGCATCGC CACCCGCTGA
 
Protein sequence
MRQVRLMLAA IVTLLAVPAL GADGIVITCV GDIMLAGSAT PTLSRSGYDY PFAKTAQELR 
RGDIAMGNLE APLTERGTEY RDKTYRFRTN PIAAAALKRA GFSVLTLANN HMMDYGNEGL
QDTLATLSRH GIAHTGAGAS LAEARREAVV SVRGKRIAFL AYSLTFPSEF YAGPNRPGTA
PGYAPHVRED IRRAKAEADY VVVSFHWGAE RAEFPKQYQT ETARLAIDAG ADAIIGHHPH
VLQGIEFYRG KPILYSLGNF AFGSRSTAAD RSVMARLTLS DEETSVELVP LNVLHRETRY
QPGILAGRKG AEVIERLNRL SQPFGTVISG SAGRFRARTS GADQRIATR