Gene GM21_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1589 
Symbol 
ID8136920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1853087 
End bp1854805 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content62% 
IMG OID644869202 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003021402 
Protein GI253700213 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.499739 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA AGTGTCTAGC GCTCCTTTCC CTCACTTTGT TAATTAATGC CTGCGCCAGC 
GAGCACGCCA CCGGCACGAT CCCTTCGGCC GGACTTGAGG CTGTTGCGGC CCCGTCGGCC
GGTCCCGGAG AGGGGCGCAC CATGTATCTC TTCGCCTTGG CTCGCCTGCG TGCCGGCGAG
GGGGACCAGG ATGCGGCGCT CGCGCTTTTG CGCCAGGCGA TGGCATCCGA CCCGGGGGCG
GCCTACCTGC ACACGGCGGC GGCCCAGTAC CTGCTGCAGC AACATAAACC CGAAGAGGCG
CTGGCCGAAA GCCAGGCCGC GATCAAGATC GACCCCACTT TCCTTCAGGC GCAGCTTCTG
TCGGGGAACA TCCTGATGAC CATGCAGCGC GAGAAGGAGG CCATCCCCTA TTACAAGAAG
GTGATGGAGC TCGACCCGAC CAAGGAAGAG GTCTACCTCC ACGTCGCCAT CTACTACCTG
AAGAGTTTCG AGTACGAGCA GGCGGTCGAC ACCCTGAAGG GGTTGGTCAA GGCTGCGCCC
GACTCGGCGC TTGGCTATTA CTACCTCGCC AAGACCTACG AGCAGATGCG TCTGCCGCGC
GAGGCGCTCG GCTACTACAA GAAGGCCCTC GACTTGAAGC CGGACTTCGA GCAGGCGCTG
ATCGAGATGG GGATCTCGCA GGAGACCCAG GGGCTCATCC CCGACGCCAT CGAGAGCTAC
AAGGGGCTTC TCGATATCAA CCCCGCCAAC GCCAACGTCG TGCAGCACCT GGCGCAGCTC
TACATCCAGC AGAAGCGGCT TAGCGAGGCG CTCGCCCTGT TGCAGGAAAA GGGGGGGAAG
ACGCTTGAGA ACTCCCGCAA GATCGGGCTC TTGTTCCTGG AGCTTGAGCG CTACGACGAT
GCGGTGAAGA CCTTTCAGGA GATCCTCGAC GTAGAGCCTG CCGCCCAGCA GGTCCGCTTC
TACCTCGCCA CCGCCTACGA GGAGAAGGAG GACGCCGACC GGGCTATCGC CGAATTCCTG
AAGATCCCCA AGGAGTCTCC CTACTACCCC GACGCCGTAG GTCACTTGGC CTACCTGTAC
AAGGAGAAGG GGACCCCGGA GAAGGGGATC GCCCTTTTGA AGGAAGAGAT CAAGGATCAA
CCGGCGCGGA TCGAGCCTTA CCTGCATCTT GCCGGCCTCT ACGAATCGAT GGAGCGCTAC
AAGGAAGGGG TCGACACGCT GAACTCGATG GACGACAAGC TCAAGAACGA CCCCCGCGTC
CTGTTCCGCC TCGGCATCCT GTACGACAAA GTCGGGCAGA AAGAGCAGTC GGTCGCCATG
ATGAAGCGCG TCATCGCCGT GAACCCGAAC GACGCCAACG CCCTGAATTA CCTTGGGTAC
ACCTACGCGG AGATGGGGGT GAACCTGGAG GAGGCGCTTT CCTACCTGAA GAAGGCGGTC
GAGCTGAAGC CGGACGACGG CTTCATCCTG GACAGCCTCG GCTGGGCCTA TTACAAGCTG
AAGCGCTACA ACGAGGCGGT CGCCCAGCTG GAGCGGGCAG CGGAGCTCTC CGACCAGGAC
GCAACGGTGC TCGGCCATCT CGCCGACGCC TACTGCGCCG CGCGCGCCTA TAAGAAGGCG
CTCCAGCTGT ACCGGAAGCT GCAGAAGCTG GAGCCCGAGC AAAATGCCGA GCTCGCCGAG
AAGATCAGGC ACTGCCGCCA GGAGAGCGGG GAGAAATGA
 
Protein sequence
MTKKCLALLS LTLLINACAS EHATGTIPSA GLEAVAAPSA GPGEGRTMYL FALARLRAGE 
GDQDAALALL RQAMASDPGA AYLHTAAAQY LLQQHKPEEA LAESQAAIKI DPTFLQAQLL
SGNILMTMQR EKEAIPYYKK VMELDPTKEE VYLHVAIYYL KSFEYEQAVD TLKGLVKAAP
DSALGYYYLA KTYEQMRLPR EALGYYKKAL DLKPDFEQAL IEMGISQETQ GLIPDAIESY
KGLLDINPAN ANVVQHLAQL YIQQKRLSEA LALLQEKGGK TLENSRKIGL LFLELERYDD
AVKTFQEILD VEPAAQQVRF YLATAYEEKE DADRAIAEFL KIPKESPYYP DAVGHLAYLY
KEKGTPEKGI ALLKEEIKDQ PARIEPYLHL AGLYESMERY KEGVDTLNSM DDKLKNDPRV
LFRLGILYDK VGQKEQSVAM MKRVIAVNPN DANALNYLGY TYAEMGVNLE EALSYLKKAV
ELKPDDGFIL DSLGWAYYKL KRYNEAVAQL ERAAELSDQD ATVLGHLADA YCAARAYKKA
LQLYRKLQKL EPEQNAELAE KIRHCRQESG EK