Gene GM21_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3814 
Symbol 
ID8139188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4385029 
End bp4386876 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content66% 
IMG OID644871433 
ProductTPR repeat-containing protein 
Protein accessionYP_003023591 
Protein GI253702402 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC TAGTAACCCG TCCCAGCGGC GCTGTCGCGG TCCTGCAAGA AGGGGCGCTT 
CTTTCCGCCA TCCTGATCAC CAGCAACAGG GAATCGCACC TCAGGGAATG CCTGTGCGAA
CTGATGCAGC AGAGCATCTC CGACCGCATG GAGGTGATCG TGGTCGATCA GGGGTCGGAG
CAGTGCGAAT GGGCCGTGGT GGCCGACCTG CAGAAGATCC ACCCGAACCT GATCTCCCTG
AAGCTTCCGG CGGCCGCCGG CGGCAAGGGG GTCGAGATGG CGCTCAGAAT CGCCTCGGGC
AAGTACGCGA CACTTTTGGA GGCGACCGAC CGCTTGAAGC GCGACGCCTA TGGACTCTTG
ACCGCGGCCC TGGAGGGAAA CCCCGCCGCG ATGCTCGCCT ACGGCGACAC CTGCTTCACC
GCCATCCCCC ACGAGAGCTT CGCCAGCCAC ACGAGCTACG GCAAGGTGAT CTGGCCCGAT
TACACCCCCC AGCAGCTGGC CCAGCTCTCC GAGGTGGCCC CGCACCCGGT CTGGCGCAGG
GAACTGCACG ACAGCGTCGG TTTCCCGCCG CAGGGGTGCC CGAACCACGG GGTGCGCGAG
TTCATGCTCA AGGTGGTGGA GCGCTTCCGC ATCCTGCACC TGGAGGAGTT CACCGGCCTC
AAGCTGATCA CCGCAAACCA GGCGCCGGTC CAGGCGGCTC AACCGCAGGC GCGCCCCGAG
CCCGCCCCGG CGGTCCATCG CGCTCCCGAG CAGGAACCCG CCCCCCCCGC CACCCCCGTC
AGCTTCGAAC AGCCGAGCGC GCCCGCAGCA CCGGTTTACA GCCAGAGCGC GCCGGCGCCC
TCGGTCCAAG CCGTGCCGCG GCAGAAAACG GCGAACTCCG AACTGAAAGG GGCGGACCAG
GTGTACCAGG AACTGCGCCC CATCGTCACA GGAGAAGACC CGCAGCGGGC CGCGGCGGCG
CTCCGTGAGC ACCTGGCGCG TTTTCCCAAG CACGCCGTGG CCCACAACGA CCTGGCAGCC
ATCAGCTATC AACTGGGCGA AAAGGAACAG GCGCTCAAGC ATTACCGCGA GGCGGTCTGG
CTCGATCCTA AAGAAAACGT CTACCTGAAG AACCTGGCCG ACATCCTTTT CGTCGAGGCG
GGAGAGGCCG ACGAGGCGAT AGCGATCTAT CTGAGGCTCC TGGAGCAGTC GCCGCGCGAC
GTCGAGACCC TGCTGAACCT CGGGATCATC TGCGAGAGCG TGGGGCAGCC CGCCGAGGCC
GAATCCTTCT ACCAGAGGGC GCTGGAGATC GAGCCTTGGA ACCAGGCCGC ACGGCAGCAA
CTGACCGCGC TGCGCCAGAG GACGGAAGAG CCCCAGCCCC CGGCTGCAAA AGACGAGGAT
CTCGCCGCAG AGGATCGGTA CCAGAGGTCC CAGGAACTGG TCTCCCAGGG GGACCTGGAC
GGGGCGTTCC AGGAACTGAA AGAAATCCTC CTCTCTTACC CCGACTTTGC CCCCGCGCAC
AACGACCTGG CCGTTTTGGC CTACCAGCAG GGGGACAAGG AGCAGGCGCG CGCGCACTAC
GAGAAGGCGG CGGAGCTTGC GCCTGGAAAC GGCACCTTCC AGAAGAACCT GGCCGACTTC
TACTTCGTCG AAGGGTACGA CGTCGACGGG GCCATCGCGA TCTACCTGGA ACAGCTCCGC
AGGGAGCCCA AGAACATCGA GACGCTGATG GGGCTTGGGA AGATCTGCAC CATACTGGAC
CGCCCGGTAG AGGCGCAGAG CTTCTACGGC AAGGTGATCA ACCTGGAGCC GTGGAACCGC
GACGCCCGCG AATGCCTCAA CAGCCTGAAG GAGGTGGCGA ACGGCTGA
 
Protein sequence
MSQLVTRPSG AVAVLQEGAL LSAILITSNR ESHLRECLCE LMQQSISDRM EVIVVDQGSE 
QCEWAVVADL QKIHPNLISL KLPAAAGGKG VEMALRIASG KYATLLEATD RLKRDAYGLL
TAALEGNPAA MLAYGDTCFT AIPHESFASH TSYGKVIWPD YTPQQLAQLS EVAPHPVWRR
ELHDSVGFPP QGCPNHGVRE FMLKVVERFR ILHLEEFTGL KLITANQAPV QAAQPQARPE
PAPAVHRAPE QEPAPPATPV SFEQPSAPAA PVYSQSAPAP SVQAVPRQKT ANSELKGADQ
VYQELRPIVT GEDPQRAAAA LREHLARFPK HAVAHNDLAA ISYQLGEKEQ ALKHYREAVW
LDPKENVYLK NLADILFVEA GEADEAIAIY LRLLEQSPRD VETLLNLGII CESVGQPAEA
ESFYQRALEI EPWNQAARQQ LTALRQRTEE PQPPAAKDED LAAEDRYQRS QELVSQGDLD
GAFQELKEIL LSYPDFAPAH NDLAVLAYQQ GDKEQARAHY EKAAELAPGN GTFQKNLADF
YFVEGYDVDG AIAIYLEQLR REPKNIETLM GLGKICTILD RPVEAQSFYG KVINLEPWNR
DARECLNSLK EVANG