Gene GM21_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3100 
Symbol 
ID8138450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3590933 
End bp3591958 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID644870704 
ProductPpiC-type peptidyl-prolyl cis-trans isomerase 
Protein accessionYP_003022886 
Protein GI253701697 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones137 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTACC ATCACGCAGC ACCGAAGTCG CTTTTGGTTC TTTGTGCCGT CACCATGATG 
CTTTCAGGAT GCGCCGGCGC GAGGCCGACG GCTACCGCGC CGCCGGCGCC TGCGACGCCG
CCGGCCGCCC AGGCCGCGCC GGCCGCCGCC GGCGAGGTGG TCGCCAGGGT GAACGGCAAG
GAGATTCACC GAAACGAGCT GGAGCGGAGC AAGAAGATCC TGATGGCCGG ACAGCCGGGG
ATCCCCCCCT ATCTGCTGAA GGAGCTGGAA AAGCAGGCCC TGGATCAGCT GGTCGGCGCG
GAACTCATGT ACCAGGCGGG CCTCGGGCTG CAGATCAAGG ACCTGGACCG GATGGCGGAC
GCGAAGCTCG TCCAGATAAA GTCGGGTTTC AAGGATCAGC AGGTGTACGA AAAGGAGCTG
GCCAACATCG GCATGACCGA GCAGATGCTG CGGGAGTACT CGCGGCGCGA CCTGGTGATC
GCGAACCTGG TCAACACCAA GCTCGCCGCC GACCTGCAGG TCACCGATCT GGAGATCGAG
AAGTTCTATG CCGACAACCC GGAACGGTTC GAGCAAAAGG AGCAGGTCAG GGCGAGCCAT
ATCCTGATCG GCTGCGACTC GAAGGGCACC GCCGAGGAGA AGAAGAAGGC CCGGGACAAG
GCTGAGAGGC TCCTCAAGGA GGTGAAGGAG GGGGCTGACT TCGCGAAGCT TGCCCGTGAA
AACTCCACCT GCCCGAGCGC CACCAACGGC GGCGACCTCG GTTACTTCCC CAGGGGAAAG
ATGGTTCCCC CCTTCGAAGA GGCCGCTTTC GCCTTGAAAA GCGGAGAGGT GAGCGACGTG
GTGGAGACCG GCTTCGGCTT CCACCTGGTG AAGCAGACCG ACCGCATCAA GGCTGAAAAG
GTCTCGCTCG CCACGGCCAG GGAGAAGATC GTCGCCTACC TGAAGAGCCA GAAGACGGGC
GAGGTGGTTG CTTCGTTCAT CGGCCGCGCC AAGCAGGATG CGAAGATCGA ACTGCTCTTG
AAGTAA
 
Protein sequence
MHYHHAAPKS LLVLCAVTMM LSGCAGARPT ATAPPAPATP PAAQAAPAAA GEVVARVNGK 
EIHRNELERS KKILMAGQPG IPPYLLKELE KQALDQLVGA ELMYQAGLGL QIKDLDRMAD
AKLVQIKSGF KDQQVYEKEL ANIGMTEQML REYSRRDLVI ANLVNTKLAA DLQVTDLEIE
KFYADNPERF EQKEQVRASH ILIGCDSKGT AEEKKKARDK AERLLKEVKE GADFAKLARE
NSTCPSATNG GDLGYFPRGK MVPPFEEAAF ALKSGEVSDV VETGFGFHLV KQTDRIKAEK
VSLATAREKI VAYLKSQKTG EVVASFIGRA KQDAKIELLL K