Gene GM21_3926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3926 
Symbol 
ID8139300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4510412 
End bp4512496 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content69% 
IMG OID644871543 
ProductTetratricopeptide TPR_4 
Protein accessionYP_003023701 
Protein GI253702512 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.0916918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTACA CGTTGCGCAC ACCCCGCCCA GGGCTCGCCC TGCTGCTCCT GCTCGTTGTC 
GCCTGCGCGC TCCCCGGCCG CGCCCAGGCG CAGCAGGATA ACCGCCTGCG CCGTATCGCC
GTCCGTCCCC ACCAGGAATT CACCAGGATC GACCTCTTCT TCCAATCCCC CCCGGATTAC
TCGCTCAGGC TCTCCCCGGG ACGGGTCCGC TTAGAGATCA GGGGGGCCGA CGCCCCGAGC
TTCAGGAAGC TGCGCGGCTA CGCGGACGCC AGGCTCTCCG GCATCTTCAC CGAGCTGCGG
GCAGGCGTCT TGAGCGTCTC CATTCCGGTG CGCGAGTCGG AGCCGGGGGT GCAGGCGGTT
TCCTACGGTA ACCCTTCGGT CCTCTCGCTC GACATCGGGC CGGGGGTGAA GCGTGCAGGA
AGGGTCGACA TCGCGCCGGG GCGCGAGCCG ATCCTCGCCG GGACCGAACA GTTCGTGCGC
GCCTTCGAAT CGGAGCCGGG CGGGCTCCCC TTCGCCCCCA CCGACGCGAA GCTTTTGAAG
GAGCTCTTCG CCCCCGAGGA GCTGGCGCTC TTCCAGCAGG GGGAGAGCCT CCTATACGAG
GAGCAGGCGG AGGTCGCGGC AGGGGTCTTC GCCACCTTCC TGGGTAAGCC CCGGGGGGCG
CGGGCGCTCG CCTTTTACCG GCTCGCAGAG GCGCTCTTGG GGCTTGGGCG CGTCAAGGAG
GCGCTGGACG CCTTCAAGCA GGGGGAGGCG GCCTGGCCGC AGTACCTGGA GCAGGCAGCC
GACATCCAGC AATCCTACGC CGACGCCCTG GCCCGAAGCG GCGACTTCGC CGCCGGGCGC
CGCAAGCTCC TGCAGCTCAT GGACCGCTAC GTCGGCACCC CCTACCAGGC CGAGCTGATG
AGCCGCCTGG CGGACCTGTC GCAGCGGGAG GGAAGAAAGC TTGCGGCGGC GGCTTTGTAC
CGGAGCGTCG TCGTGCACGC GCCGGGAAGC GCCGCCGCGG CAAGGGCGCG CCTGAAGCTT
GCCGACCGGG AACTCTTCAC CCTTTCCCGT GACCGCTACC GGGAACTTTT GGCCAAGTAC
CAGGGGGTCT ACCAGGCGCC CGGCGATTTC GCCACGAGGG ACGAGGCGCT CTTCAAGACG
GCGCTGCTAC TGGCCCTTTA CGCCCCCCCC CGGGAGGCGC TGGAGGCCGT CATCAGCTAC
GACCGGCGCT ACCCGCGCGG CATCTTCAGC ACCATCGTCA AACAGATGCG CGAGGAGCTC
TTGCTCCCCA GCTACCGCGA GCTCGCCGCT GCGGGAAAGG ACGAGGCCCT GGTGCAACTC
GCGCGGGAGA ACCGGGAGCA TCTCGCCAGG TGCTTCAGCG ACCCCGCTTT CCCCGAGAGG
CTCTCCCAGG CCTTCGAACG GACCGGGAAG CTGACCCAGG AGATGGAGCT TTTCGGTTAC
CTGGACGAGA AGAACTGGGC CGCGGCGGGC GCCCCCTTCA TGCTCTCCCG CGTGGTCGAC
GACGCGGTCA CCTTGGGCAA CCTGGCGCAG GCGGAGGAGG CGGGGCGCAG GTTCCTGCAG
CGTTTCCCGG CCGACGCGCG CGCCGCGAGG GTCAAGGAGC AACTGGGGCG GATCGCCTTC
GAAAAGGGGG ACCTGCCGGC GGCGGGCGCG GAGCTTCGCT TCCTCGCGGC CAAGGGGGCC
CGGGCGCAGT TTCCCGACAG CGACTATTAT CTCGGCAAGG CGCAGCTGGC CGCCGGCGAC
CACAGGGGGG CGGTGCGGAG CCTCGCCCGG TTCACCCAGA ACGTGAGGGA GGACTCGGCG
CTTCTTCCCG ACGGCTACTT CACCCTGGCC GGGGCGCTTG CCGCCTTCAA GGATTACGAC
CGGGCGCTTG CCGCCTGCGA GGTCGGGGGG AGGGTCGTCG CCGGGGAGGG GGTCGCCCAG
TTCAAGTACA AGGCGGGCGA ACTCTACCTG CAGCAGGGGG AGGTGGCCAA GGCTCAAGCG
AGCTGGGAGA AGGCGGCGGC TTTAGGGGGG ACCTGGGGGA AACTCGCCGG CGAGGCGCTC
GCGGACCTCA AATGGCGCAT GAAGATCTCC AAGGAGCTTC CCTGA
 
Protein sequence
MTYTLRTPRP GLALLLLLVV ACALPGRAQA QQDNRLRRIA VRPHQEFTRI DLFFQSPPDY 
SLRLSPGRVR LEIRGADAPS FRKLRGYADA RLSGIFTELR AGVLSVSIPV RESEPGVQAV
SYGNPSVLSL DIGPGVKRAG RVDIAPGREP ILAGTEQFVR AFESEPGGLP FAPTDAKLLK
ELFAPEELAL FQQGESLLYE EQAEVAAGVF ATFLGKPRGA RALAFYRLAE ALLGLGRVKE
ALDAFKQGEA AWPQYLEQAA DIQQSYADAL ARSGDFAAGR RKLLQLMDRY VGTPYQAELM
SRLADLSQRE GRKLAAAALY RSVVVHAPGS AAAARARLKL ADRELFTLSR DRYRELLAKY
QGVYQAPGDF ATRDEALFKT ALLLALYAPP REALEAVISY DRRYPRGIFS TIVKQMREEL
LLPSYRELAA AGKDEALVQL ARENREHLAR CFSDPAFPER LSQAFERTGK LTQEMELFGY
LDEKNWAAAG APFMLSRVVD DAVTLGNLAQ AEEAGRRFLQ RFPADARAAR VKEQLGRIAF
EKGDLPAAGA ELRFLAAKGA RAQFPDSDYY LGKAQLAAGD HRGAVRSLAR FTQNVREDSA
LLPDGYFTLA GALAAFKDYD RALAACEVGG RVVAGEGVAQ FKYKAGELYL QQGEVAKAQA
SWEKAAALGG TWGKLAGEAL ADLKWRMKIS KELP