Gene GM21_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3824 
Symbol 
ID8139198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4404243 
End bp4407563 
Gene Length3321 bp 
Protein Length1106 aa 
Translation table11 
GC content70% 
IMG OID644871441 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003023599 
Protein GI253702410 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.138221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGC CCGAGAGCGC CCGAGAGCTT TTCCTGGCGG GGAACGCGCT TTTCGGCGCG 
GGGGACCTCT CCGGAGCGGC CGAGTGCTAC CGGCGCGCCC TGCAGCTCGA TCCCGGCTAC
GCCGAGGCGT GCTTCAACCT GGGGTGCGCC CTGGACCGCC TGTCCGGCCC CGCAGAGGCG
CTGCCCCACC TCGCGCGCGC CGCGGAGCTA TCGCCCGAGT GGAGCCGGGC CCGCGGCAGC
CTGGGTTTCG CCCTGGCCCG CCTCGGGCGC ATGGGAGAGG CGGCGGGGGA GCTCGCCGCG
GCCGTCCGGC TCGATCCCGG CGACCCCGGT CTCTCGAACA ACCTGGGGCT CGCCCTCTCG
GCGCTTTCCC GCGGCGAGGA GGCGAGGGAT GCATTCGAGG AGGCGATCCG CCTCGACCCG
CTCTACGCCG AGCCCCACAA CAACCTCTCC ATCCTCTTCG AGCGTTTCGG CGAGAGCGCC
CACGCCATAG CGGCGGCACT TGAGGCGCTC CGGCTGAAGC CGGAATTCCC CGAGGCGCAC
CTGAACCTCG CCAACGCCCT CAAGTCGCAG GGGCGGCACC AGGAGGCGAT CGCCCACTAC
CGGGAGGCGC TGAGACTTCG TCCCGACTAC CGCGAGGCGG AAAGCTCGCT GCTCTTCGCG
CTCCTTTACC CCGCGCACAC CCCCGAGGAG GAGCTCTTCG CCGAGCACGC AGCCTTCGGG
GCACGCTGCC GTTTCTCAGC ACCCAGGCAC GTGAACGACC CGGACCCGGA GCGCCCTCTG
AAGCTGGGTT ATCTCTCCGC CGACTTCCGG GAGCATGCCG TGGCCCGCTT CATCGAGCCG
GTGCTGGCCC ACCACGACCG CTCCCGGTTC CGCATCTATT GCTACTCGAA CGTCTCGGCC
CCCGACCAAA GAAGCGAGAG GCTCGCGGCT CTCGCCGACT GCTTCCGGAG CATCGCCGGG
ATGACGGACC AAAAGGTCGA GGAGCTGGTG CGCGCGGACG GGATCGACAT CCTGGTCGAC
CTCTCCGGGC ACAGCGCGGG AAACCGCCTC CCGGTCTTCG CCCGCAGGCC CGCCCCGGTG
CAGGTCACCT GGCTCGGCTA CCCCTTCAGC ACCGGGCTGG AGGCGATCGA CTATCGCATC
ACCGACCCGG TCTGCGACCC CCCGGGCGAG ACCGAGCGCT ACCACAGCGA GGAGCTCTTG
CGGCTCCCCG GGACCTTCTC CTGCTTTCTT CCCCCCGATG ACGCGCCCCC GCCGGTGGGC
GCACCGCTTT CAAAAAACGG CAGGGTCACC TTCGGCTCCT TCAACAACCC GGCGAAGATC
ACCCCCGAGA CGGTGCTCCT TTGGTCCGGG GTGCTGCGCG CGGTCCCGGG GTCGCATCTC
CTCTTGAAGG GGTATTCGCT CGCCTGCGCC GAGACGAGGC TCCGCCTTGA GGAGGCCTTC
GCCGGGCACG GCATCGAGCG GGAGCGGCTG GAGCTTATGG GGAACACCCC CAGCTACCGC
GACCACCTGG CGCTCTACGA TCGGGTCGAC ATCGCCCTGG ACAGCTACCC CTACAACGGC
ACCACCACGA GCTGCGAGGC GCTCTGGATG GGGGTCCCCG TGGTGACGCT GGCGGGCTCC
TCGCACCGCT CGCGGGTGGG CGCCTCTCTT TTGCAGGCGC TGGGGCTTGA GGGGCTGGTG
GCGCACGAGG CGCGGAAGTT CGTGGCGCTC GCCGCTGCTC TGGCCGGGGA TCCGGAGAGG
CTCTCAGGCC TCAGAAGCAC GCTGCGCCGG ACCATGGCCG CCTCCCCCCT CACCGACGGC
GCCTCCTTCA CCCGTCACCT GGAAAAGGCC TGGCGCGACG TCTGGGCGAG GTGGTGCCGC
AGCCACCCGG CCCAGGCGCC GGACCCCGCG GTGCAGGGGG CGCAATACCT GCAGCACGGC
AGGCTCGACC GGGCGCTCTC GCAATTCCTG ATACCTTTGC GCGGCGGGGA GAGGAGCACC
CTCGGGGGGA TCCAGGAGGC GCTGCGCCTG CAGCTCGCGG CGGACCAGGC GCGCGCGCTG
GCTCTCGACG ACCCGCTGGC CTTGCGCGAG GAGGAGACGG AGCATTTGGG CTGCGAGACC
CTGGCCGAGA CGGCCGAGCT CCTGGTTGCC GCCGGCTTCG TGACGCCGGC AGAGCTCATC
TGCCGCTACC TGGGCGACCG CGGCTACCTG AGCCCCCGGG TGAGCCGTAC CCTCGCCGAG
GTGGCGCTCG CCATAGGGAA GCCCGAGGTC GCGGTGCGCG AGTTCGAACG CGCCCAGGCC
GCGGGGGACC GCTCCCGCGC CACCCGCATC AAGCTGGTGA AGGCGCAGGA GGCTGAGCGG
CTCTCCCCCC CTCCGGCGCG GGTAGAGCGT TTCCTTCTCA TCAAGGCCTG GGGGTACGGC
TTCTGGAGCG ACGTGAACAT GCTTTTGGGG CAGTGCCTTT TGGCGGAGAT CACCGGGCGG
GTCCCGGTGG TGCACTGGGG GGGGAACTCC CTTTTCTCCG ACGATCCCGG GCGAAACGCC
TTTTCGAGCT TCTTCCTCCC CTTCAACGGC ACCGGCATCG GCGAGCTCGC CGCCAAGGCG
CGTAGCATCT ATCCCCCCAA GTGGAACCGG GAGAACCTCC TTTTGGACGA GCTCAACAAG
GAGGAGGGGC CCTGGTCCCG CTTTTCCTCC CTCTACGCCC TGGAGCGCGG CGAGGAGGTG
GTGGTCGGGG ACTTCCATTA CGGCGTGAAC GATTTCATCC CCTGGATCCC GCCGGGGCAC
CCTCTCTACG GGCTCGACCC GGACGCGCTC CACCTGCTGC TTTACCGGCG CTACCTCAGG
CCGCGCCCTG AGCTGGAGCT GCGCGCCGAG ACCTTCTTCG ACCGGGAATT CTCGGGCCGC
CCCGTGCTGG CGCTCCACGT GCGCGGGGGG GACAAGGGGG GGGAGGATCC CGGCCTTCAC
CGGCTGAACG CCCTCTACCA CCCGCGGATC GAGCGCTTCC TGGGCGAGGA GCGGGAAGGG
CGCCTTTTCC TTCTCACCGA CGACGACAAC CTCTTGGCCT CGTACCGGGA GCGCTACGGC
GACCGCCTCT CCCACACCGC CTCGACCCGC ACCGGCTCCA GCCTCGGGGT GCATCACCAG
GAACAGGCGG ACCGCAGGGC GCTGGGGGAG GAGGTGCTGG TCGACGCGCT GATCGCCGCG
CGCTGCCGCC TTTTCCTCGG CAACGGCTTT TCCAACGTCT CCCTCGCGGT GGCCCAGATG
AGGCGCTGGG AGCGGGGGAG CTGCGTCCTT TTCGGCGCCC GGCTGGACCG GGTCCGGCAG
ATGACCCTCT ACAGGAGCTG A
 
Protein sequence
MTAPESAREL FLAGNALFGA GDLSGAAECY RRALQLDPGY AEACFNLGCA LDRLSGPAEA 
LPHLARAAEL SPEWSRARGS LGFALARLGR MGEAAGELAA AVRLDPGDPG LSNNLGLALS
ALSRGEEARD AFEEAIRLDP LYAEPHNNLS ILFERFGESA HAIAAALEAL RLKPEFPEAH
LNLANALKSQ GRHQEAIAHY REALRLRPDY REAESSLLFA LLYPAHTPEE ELFAEHAAFG
ARCRFSAPRH VNDPDPERPL KLGYLSADFR EHAVARFIEP VLAHHDRSRF RIYCYSNVSA
PDQRSERLAA LADCFRSIAG MTDQKVEELV RADGIDILVD LSGHSAGNRL PVFARRPAPV
QVTWLGYPFS TGLEAIDYRI TDPVCDPPGE TERYHSEELL RLPGTFSCFL PPDDAPPPVG
APLSKNGRVT FGSFNNPAKI TPETVLLWSG VLRAVPGSHL LLKGYSLACA ETRLRLEEAF
AGHGIERERL ELMGNTPSYR DHLALYDRVD IALDSYPYNG TTTSCEALWM GVPVVTLAGS
SHRSRVGASL LQALGLEGLV AHEARKFVAL AAALAGDPER LSGLRSTLRR TMAASPLTDG
ASFTRHLEKA WRDVWARWCR SHPAQAPDPA VQGAQYLQHG RLDRALSQFL IPLRGGERST
LGGIQEALRL QLAADQARAL ALDDPLALRE EETEHLGCET LAETAELLVA AGFVTPAELI
CRYLGDRGYL SPRVSRTLAE VALAIGKPEV AVREFERAQA AGDRSRATRI KLVKAQEAER
LSPPPARVER FLLIKAWGYG FWSDVNMLLG QCLLAEITGR VPVVHWGGNS LFSDDPGRNA
FSSFFLPFNG TGIGELAAKA RSIYPPKWNR ENLLLDELNK EEGPWSRFSS LYALERGEEV
VVGDFHYGVN DFIPWIPPGH PLYGLDPDAL HLLLYRRYLR PRPELELRAE TFFDREFSGR
PVLALHVRGG DKGGEDPGLH RLNALYHPRI ERFLGEEREG RLFLLTDDDN LLASYRERYG
DRLSHTASTR TGSSLGVHHQ EQADRRALGE EVLVDALIAA RCRLFLGNGF SNVSLAVAQM
RRWERGSCVL FGARLDRVRQ MTLYRS