Gene Gmet_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_1233 
Symbol 
ID3738410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp1390389 
End bp1392110 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content55% 
IMG OID637778513 
Productintermediate filament protein 
Protein accessionYP_384194 
Protein GI78222447 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.190296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC GTATTGTCGT GGCATTATTC CTGGCTCTTT CCTTTCTTCC GGGATGCGCC 
ACAAACGGCG CGGGAAAGCC TCTTCCTGCC AACGAGCATT CATTTCAGCC GACCGTCAAT
ATTGCCGGAT CCAGGGCTCT CTACATTTAT TCATTGTCGC GTCTCCGCGA GCTGGACGGG
GATTTCGAGG GTGCGCTCAC GCTCCTGAAT GGTGCCATTG AGGCTGACCC CAACTCCGCG
TTTCTTCATA CGGCAGCCGC CGAGATCTAT CTGAAATCGG GCAAACTCGA CGATGCGCTC
CGTGCATGTG AAAACGCCAT CCGCGTCGAT CCCGGGTTCC GTCCGGCACG GATCATTGCC
GGGACCATCC TGGCAAACCT CAAGCGGGAC AAGGAAGCTA TTGTCCATCT CAGCAAGGCC
ATTGAGCTTG ATCCGACCAA GGAGGATGCC TATCTCCACC TGGCCATCTC CTATGTCCGA
ACCTTCGATT ACGAGCAGGC GGTTAACACC CTCAAGTCCC TGATCAAGAT CAACCCCGAG
TCATCGCTGG GCTACTATTA CCTGGGCAAG ACCTACGACC AGATGAAGCT CCAGAAGGAG
GCTGCCAACT ATTACAAGAA GGCCATCGAG ATCAAGCCCG ATTTCGAGCA GGCCATAATC
GACCTGGGGA TTTCCCAGGA GGGGCTCGGC CTCTATGATG ATGCCATCGC CACCTACAAG
CGGCTGCTTG AGACAAATCC GTTCAACATG AACGTGCTGC AGCACCTGGT GCAGCTTTAC
CTCCAGCAGC AGCGGCTGGA GGACGCGCTC CCACTGCTCA TCGTGATGAA GGATCGGGGT
GTCGGCGGCC TGGAAACCCA GCGCAAGATC GGCCTCATCT ATATGGAACT GGAGCGCTAT
GACGAGGCAA TCGCCGAATT CGAGCAGATC CTTGCCCGGG AACCCAAGGC CCACCAGATT
CGCTTCTACA TTGCGAGCGC TTACGAGGAG AAAGAGGAGT TCGACAAGGC CATAGAGGAA
TTCAGTAAGA TTCCTCCGGG AACCGCCAAT TACGTTGAGG CCTTGGGGCA CATCGCTTTC
ATGTACCGGG ATCAGGAAAA ACCTGAGAAG GGAATCCAGA TCCTGACGGA TGCCATTACT
GCCAATCCCG ATAAATTGGA CCTCTATCTC TATCTGGCCG GTCTGTACGA ATCAATGGAT
AAGTTCTCAG AAGGACTTGC CGTTCTCAAG GGGGTTGAAG GGAAATTTGC CGAGGACCCG
AGGCTCCACT TCCGGATGGG AACCATCCTC GACAAAATGG GGAACAAGGA GGAGTCCATC
GCCCGGATGA AGCGGGTTAT CGCCATTACG CCCGACGATG CCCAGGCCCT CAACTATCTG
GGCTATACCT ACGCCGAGAT GGGCATCAAG CTGGATGAAG CCCTCCAGTA CCTCAAGAAG
GCCGTGGCGC TTCGTCCCAA CGACGGTTTT ATTCTCGACA GTCTCGGCTG GGTCTACTTC
AAGATGAAGC GTTATGACGA GGCCGTGCCG CTCCTGGAGC GGTCGCTCAA GGTCGTGGAG
GACGATCTGA CGGTCATGGA GCACCTGGCC GACGCCTATG CAGCCAACCA TGAATACCGC
AATGCCTTGA AACTTTACAA AAAGATCCTC GACGCCGACC CGGGTCGCAA GGATATCGCC
GAAAAGAAGA AAAAGGTCAG GGCGGAAAGT CTGGAAAAAT GA
 
Protein sequence
MKKRIVVALF LALSFLPGCA TNGAGKPLPA NEHSFQPTVN IAGSRALYIY SLSRLRELDG 
DFEGALTLLN GAIEADPNSA FLHTAAAEIY LKSGKLDDAL RACENAIRVD PGFRPARIIA
GTILANLKRD KEAIVHLSKA IELDPTKEDA YLHLAISYVR TFDYEQAVNT LKSLIKINPE
SSLGYYYLGK TYDQMKLQKE AANYYKKAIE IKPDFEQAII DLGISQEGLG LYDDAIATYK
RLLETNPFNM NVLQHLVQLY LQQQRLEDAL PLLIVMKDRG VGGLETQRKI GLIYMELERY
DEAIAEFEQI LAREPKAHQI RFYIASAYEE KEEFDKAIEE FSKIPPGTAN YVEALGHIAF
MYRDQEKPEK GIQILTDAIT ANPDKLDLYL YLAGLYESMD KFSEGLAVLK GVEGKFAEDP
RLHFRMGTIL DKMGNKEESI ARMKRVIAIT PDDAQALNYL GYTYAEMGIK LDEALQYLKK
AVALRPNDGF ILDSLGWVYF KMKRYDEAVP LLERSLKVVE DDLTVMEHLA DAYAANHEYR
NALKLYKKIL DADPGRKDIA EKKKKVRAES LEK