Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gmet_2032 |
Symbol | |
ID | 3739812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter metallireducens GS-15 |
Kingdom | Bacteria |
Replicon accession | NC_007517 |
Strand | - |
Start bp | 2274011 |
End bp | 2276662 |
Gene Length | 2652 bp |
Protein Length | 883 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637779326 |
Product | TPR repeat-containing protein |
Protein accession | YP_384986 |
Protein GI | 78223239 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.192732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.125101 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTTAACA GACGTATTGC ACTGATTTGT CTTATTGTGG CGACTCTTTC CGCTTGTGGG GGGAAAACGA AGGAAGAGCT CTACGCCGAG GCAGTTAAGG AACTGGACAA GGGCAACGCC AATGGAGCGA TTGTTCTTCT GAAAAATGCC GTTGAAAAAG ATCAAAATTA TTTCGACGCC CGTTACAAGC TTGCGAAGGC CTACATGACA GTGGGCAAAT TCGAACAGGC GGAAAAAGAG TTCCAGAAAG CACTCCGGCA AAATCCGTCC AATCCGGAGA TAAGGCTTGA TCTGGCAAAA CTCTACAATT CAATCAACAA GCCTGACGAG TCAATAGCTG AGGCCAAGGC GTATCTCTCC GCACGGGCCG GATCGGCGGA TGCCCTGGAA GTGATTGGCA CGAGCTACGG CCAAAAGAAG ATGTTCGATG AGGCCGAGAA ATACTTGAAG GAGTCTCTCC AGGCTGAACC TGCCCGTGCT TCGGCAATGC TGCAATTGGC AAAGGTTTAT CTGGCGACGA AGCGGGAGCA GGAAGGGATG GGGCTCCTCA ACGAGATCGT TCGGAAAGAC CCTAAAAACA CCAAAGCATA CTATCTTGCC GCCTTCTATG AAGGATATCG CGGCAACAGT GAAAAGGCCC TTGAATACTA CCAAAAGATC ATGCAGGCGG ATCCCGATGA TGCCAATGCG GCCTTCAGGA TTGGCATGAT GCATATCAGC AAGGGTGAAG TGGACAAGGC TGAACGCCTT GCCGATGACC TGATCAGGAA AGCGTCCAAA CGGCCGGAAG GACACCAACT TAAGGGGATT GCTCTCTATA CCAGGAAAAA CTACGACCAG GCAATCACCG AACTTCAAGG TGTCGTGAAA AACTATCAAA ATCCCGGTGC CTACTACTAT CTGGGCTTGA GTTACTATCA ACGCAACGAT CTTGAGAGCG CCCTGAGCCA GTTCCGCAAA GTGATTGACC TCAACCCGAA GCATCTGCAG GCCCGGCTCA TGTCCGCGAC GGTTCTGCTT CAGCAGAAAC GCACCGATGA TGCCATCACC GAGGCCAAAC GAGTCATTGA GCTGGACGAC AAGAATGCCT TTGCCCATAA CGTCCTCGGC AGCGCCTACA TTGCCAAGGG GATGACTGAC GAGGCAGTCA GGGAGCTCAA CCGCGCGATC GAGCTTGAGC CGAAGCTGGT CGGTGCCCAC ATGAAGAAGG GGATCGTTAA CCTGGCGCGC GGAAAGAGGG CGGAAGGCGA GGAAGAGCTT GAAACGGCAG TGAAGGTCGC TCCCGATGTT CTCAATAGCC GGTATCTGCT GGCAAGCTAC TACATGCGCC TGAACAAACA GGCCCGTGCC ATCGAGGTGC TCAGGGCGGG GCTGAGGGGA GGCAAGCAGG ATGCTCCTCT CTACAATACA ATGGCCGCTG CCGCCTTCGC CGACAAGCGG GAGAAGGATG GCATTTTGTA TCTTCAAAAG GCGCGGGAAG TCGATCAGGC ATATCTCCCG GCTTCCTTCA ACCTGGCATC CTATTACATC AGCAAAAAGG ACAATGGGCG CGCAGCCGGC TTGTTCAACG AGATCCTGAA GCGTGATCCG GCCAACCTCA AGGCGTTGAT CGGCCTCGGT GCCATCAGCG AATCGGAGGG GAAGGCGCAG GAGGCAGTCG CCTACTATAC CAAGGCCAAG GCAACGAAGG ACCCCGTCGG CTATCTCTCT CTTGCATCCT ACTACATCAG GAAAAAGGAG GCGCCCAAAG CCCTTGCTTT AGCAAACGAG GGGTTGAAGG GGGCTCCGAA CAATGCTGCG CTACTGGAGT TGAAGGGTGG ACTTCTCGCA GCCGATAAAA AGTACGCGGA GGCCCTCAAA GTCTATGATG AGCTGGAAAA GGTCGCCCCG ACCCGTGCGA TTCCGCTGAA AATCGGAGCC TTGGTGGTAA CCAAAAACAT TCCCAAGGCC GTGGAGCAGG CACAGAAGGT TGTCGCAGCC AACCCCAAGT CTGCGGCCGG GCATCTTGTT ATTGCGTCGA TCTACGAGAG CCAGAAAGAC TATGACCGTG CCATTTCGGC TGTCAGAAGC GGCATAGCAG TTGAGCCCGG CAATCTCCGG GCGTCCATGG CCTTAGGCGG GATCTACGAG AAGAAACGGG ATTTTGCCAG AGCTATTGCC GCCTATGACG ATATCCTCAG GAAGAATCCC AAGAATATTC CTGCCCTGTT TGCCAAAGGA GCGGCTTTCG ATCAATCGGG CAAGAAAAAA GAAGCTGCCG GTATCTATCG GGCCGTACTT AAAATAGCGC CTGATTTTGT ACCCGCACTT AATAACCTGG CATATCTTGC TGCCGAAGGA TATGGAAACA AGTCAGAGGC GTTGTCCCTG GCCAGTCGGG CAAACAAGCT CCAACCCAAT AACGCCGGGG TCATGGATAC CTACGGCTAT GCGCTGCTGA TTAACGGCAA GAGGACCGCA TCGGTCAGAG TGCTCGAAAA GGCAGTCTCT CTCCTTCCGA ACAACCCCGC CGTGCACTAT CACCTTGCCA TGGCTTATCG TGACATGGGT GACAGGGCAA AGGCCAGCGT CAGTGTCCAG AAATCTCTTC AATATGGGGA ATTCCCCGAA AGTGCCGCGG CAAGGAAACT TCTTGCCGAA CTGAAGCGCT AG
|
Protein sequence | MFNRRIALIC LIVATLSACG GKTKEELYAE AVKELDKGNA NGAIVLLKNA VEKDQNYFDA RYKLAKAYMT VGKFEQAEKE FQKALRQNPS NPEIRLDLAK LYNSINKPDE SIAEAKAYLS ARAGSADALE VIGTSYGQKK MFDEAEKYLK ESLQAEPARA SAMLQLAKVY LATKREQEGM GLLNEIVRKD PKNTKAYYLA AFYEGYRGNS EKALEYYQKI MQADPDDANA AFRIGMMHIS KGEVDKAERL ADDLIRKASK RPEGHQLKGI ALYTRKNYDQ AITELQGVVK NYQNPGAYYY LGLSYYQRND LESALSQFRK VIDLNPKHLQ ARLMSATVLL QQKRTDDAIT EAKRVIELDD KNAFAHNVLG SAYIAKGMTD EAVRELNRAI ELEPKLVGAH MKKGIVNLAR GKRAEGEEEL ETAVKVAPDV LNSRYLLASY YMRLNKQARA IEVLRAGLRG GKQDAPLYNT MAAAAFADKR EKDGILYLQK AREVDQAYLP ASFNLASYYI SKKDNGRAAG LFNEILKRDP ANLKALIGLG AISESEGKAQ EAVAYYTKAK ATKDPVGYLS LASYYIRKKE APKALALANE GLKGAPNNAA LLELKGGLLA ADKKYAEALK VYDELEKVAP TRAIPLKIGA LVVTKNIPKA VEQAQKVVAA NPKSAAGHLV IASIYESQKD YDRAISAVRS GIAVEPGNLR ASMALGGIYE KKRDFARAIA AYDDILRKNP KNIPALFAKG AAFDQSGKKK EAAGIYRAVL KIAPDFVPAL NNLAYLAAEG YGNKSEALSL ASRANKLQPN NAGVMDTYGY ALLINGKRTA SVRVLEKAVS LLPNNPAVHY HLAMAYRDMG DRAKASVSVQ KSLQYGEFPE SAAARKLLAE LKR
|
| |