Gene GM21_0616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0616 
Symbol 
ID8135931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp747631 
End bp749574 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content63% 
IMG OID644868233 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003020448 
Protein GI253699259 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.1691e-34 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAGAT ATGCAGCGCT ATCCTCGCTT CTTGCGGTGA CCCTGTTCTC CACCGGGTTC 
AACTGGCCGT TTCCTACGGG CAACGCCTGC CGCGACGCCA AGCGCATCAT CCTGGAACTC
CCCCCGCAGG CGGGGGAGCA GAAGAGGAAG GAAGCCGAAA AGCGGGTGGC CGAACTCTGC
CCGACCGGCC CGGCCGGACA TTACCTGAAG GGGCTCACTT TCGAGAGAAG CGGCAATGTC
GACGCCGCCA TAAGCGAGTA CCGGGAGACC CTCTCCCTCG ACCCCGAGTT CTACCCCGCC
AGCGGAAACC TGGGCCTTTT GCACCTGCAG AAGGGGGGCG GCGAGGAAGC TGCCGTGGAG
CTTGCCGCGG GACTTAAGGC CGGGGACCCC CGCTACCACG CAGGGCTTGC CCGGGTCATG
GCGGACAAGC AGATGCACCT GCTCGCCATC TTCCACTACA ACGAGGCGAT TGCCGCTTTC
CCCGACGACG CTGCGCTTTA CACCGGCGTC GCGGCATCCT ACAACGCGGC GGGGCAGAAA
CAGAAGGCCG AGGACGCCTA CCGCAGGGCC ATGGTCTTGC AGCCCGACAA CGCGCAGGCC
CGCTTTGGCC TTGGCGCCCT TCTCCTGGAG CGTGGCGAGG TCGACAAGGC GGTCGGCGAG
TTGAAGCTCG CGGCCATCGC CCAGCCGGCC AACAGGGAGA CACACCGGCT CTTAGCCGAG
GCATACGCCC GCAAGGGTGA CGCGAAGAGC GCCCACTACG AGCGCGGGCT CGCCGGCATC
GGCACGAAGC TGAAGGAGCT CCCCAAGGTC GACCACATGG CGCTGGCTGA AAAACACCGC
CTGGCCAAAG ACCACGAGAT GGCGATCAGC GAGTACCGGA TGCGGCTGGC CGAGGAGCCC
GACGACGCGC TGGCCCAGCA GCGCCTGGGG GACACCCTGC TCGCGGTGGG ACGCGAGGAC
GAGGCGATGT CGTACTACCG CGACGCGCTG AGAAACAAGG CGGAAACCCC CGAGCTCCAT
TTCAACCTGG CCGGGATCTA CGAACGCAAG GCGCTTCTGG ACGAGGCGGT GGTCGAGTAC
CGTCAGGTAC TGGCATCGAA CCCCGACAAC CAGCATGCGC GGCAGCGCCT GGCCGATATC
TACACGCTGC GCGGCAGCTT CAATCAGGCC CTCGAACAGT ACCAGGCGCT CATCAAGACG
AACCCCGCCG ACCCGGCGCT GCAGCTGAAA CTCGCGCGCG CCTACGTCAA CAGCAAGGAG
CTTGACGCCG CGGCCGAGGC CTACCAGGCA GCCCTCAAGC TGGACGGCGA GTCGGTGGAC
GCCCACCGCG AGCTCGCCAA CCTGCAGAGA AAAAGAAACC TGATGGACGA GGCGGCCGCC
GAATACCAGG AAGTGCTCAG GCTGAAAAAG GACGACCAGG AAGTCCGCAC CGCCCTCACC
GCCATCTACG TGAAGAACAA GAACTACGAC GCCCTGGCTC AGCTCCTGAA GGACGGAGTG
GAGCTCTCCC CGAACGACCC CAACGCGCAC TACAAGCTGG GACTGGTCTA CGAATTCCAG
AAGGATTACA CCGCGGCCAC CGCCCAGTAC AAGGAAGCGG TGACCCTGAA GCCCGACCAT
GCCAAGGCCT TGAACGCCAT GGGACGGGTC CAGATGAAGG ACGGCCACCT CGCCGAGGCA
AAGGAGTCGC TCGAAGCGGC GAGGAAGGCG GACCCCGACC TGGAAGAAGC CCAGGTCCTT
TTGAGCAACA TCAAGGACGA GTTCACGCCC GAGCCCAGGA GTTACCGAAA GCACAAGTCC
TCCAACGGGA GCAAGGCCAA GAAAGGGAAG AAGGGGAAAA AAGGGAAGGA AGCGAAGAAA
TCCAAGAAGA AGAACAGTGA CGACAAGCCT GCCAAGAAGT CGAAGAAGAA GAAAAAGTCC
AAGAAGAAGA GTAAGGAAGA CTAA
 
Protein sequence
MKRYAALSSL LAVTLFSTGF NWPFPTGNAC RDAKRIILEL PPQAGEQKRK EAEKRVAELC 
PTGPAGHYLK GLTFERSGNV DAAISEYRET LSLDPEFYPA SGNLGLLHLQ KGGGEEAAVE
LAAGLKAGDP RYHAGLARVM ADKQMHLLAI FHYNEAIAAF PDDAALYTGV AASYNAAGQK
QKAEDAYRRA MVLQPDNAQA RFGLGALLLE RGEVDKAVGE LKLAAIAQPA NRETHRLLAE
AYARKGDAKS AHYERGLAGI GTKLKELPKV DHMALAEKHR LAKDHEMAIS EYRMRLAEEP
DDALAQQRLG DTLLAVGRED EAMSYYRDAL RNKAETPELH FNLAGIYERK ALLDEAVVEY
RQVLASNPDN QHARQRLADI YTLRGSFNQA LEQYQALIKT NPADPALQLK LARAYVNSKE
LDAAAEAYQA ALKLDGESVD AHRELANLQR KRNLMDEAAA EYQEVLRLKK DDQEVRTALT
AIYVKNKNYD ALAQLLKDGV ELSPNDPNAH YKLGLVYEFQ KDYTAATAQY KEAVTLKPDH
AKALNAMGRV QMKDGHLAEA KESLEAARKA DPDLEEAQVL LSNIKDEFTP EPRSYRKHKS
SNGSKAKKGK KGKKGKEAKK SKKKNSDDKP AKKSKKKKKS KKKSKED