Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3833 |
Symbol | |
ID | 8139207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4418001 |
End bp | 4420283 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644871450 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003023608 |
Protein GI | 253702419 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.00358812 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGCTA GCTCCCGCGC GGATAGCCCG GCCGAGACGT TCGCCGCGGC CCTCGATCTG CAGAAAAGCG GCCACAGGGG CGAGGCCGAG CGGCTCTACC GCGCCCTGGC GGCATCGGGG GGGGAGTTGG CCGCCGACGC CTGCATCAAT CTGGGGGCGC TCCTGGACGA GAGCGGGCGC GCCGAGGAGG CGCTGGAAAA GTACCGCGAG GCGCTCGCCC TGCGGGAGGG GGACCCCCTC GCCCTAAACA ACGCCGGTTC CACGCTGTTC AAGCTGGGGC GCTTCACGGA GGCGGCGCAA CTATTCCGCC ATGCCCTGGA GCGGGCCCCG GATTCCCTGG AGGCGCAGGT GGCGCTCGGC GCGGCGCTGC AAAGGGACGG GGATCTCCCC GCGGCGCTCG CCGTCTTCCG CGATCTGGTG GCGCGGCGCC CGGATTGCGC CGAAGCGCAC TGGAACCTGG CGCTGGCCCT CCTCCTGGCG GGTGAGTTCC GCGAGGGGTG GCAGGAGTAC CAGTGGCGCT GGCGCAGGGA CTCCTTCACT TCGCCCCGGC GCGAGCTTGC GGCGCCCGCC TGGGACGGCA CCCCTCTTCA AGGGCGCCGC ATCCTGGTGC ACGGCGAACA GGGGCTGGGC GACACCATCC AGTTCGCCCG CTACCTCCCC ATGGTCGCCG CCGCGGGAGG GGTGGTGGTG GCGGAATGCC AGTCCCCCTC CCTGGTGCCG CTCTTGCGCT CCATCCCCGG CGTCGCCGAG ACCTGCGTCA TGGGGGAAAC GCTTCCCCCC TACGACCTCG AGGTCGCGCT CCTGTCGCTC CCCCACCTGT TCGGCACCAC CCTGGAGAAC GTTCCAAGCG GGGTCCCCTA CCTGGCGCCC CCACAGGACC GGATCGCCCC CTGGCGGGAG AAGGTGGCGG CGGACCTGGG GTTCAAGGTG GGGCTGGTCT GGGCCGGGAA GCCGGTTCCG GACCCATTTC GCTCCTGCAC GCTCGCGGCG CTCTCGCCTC TCTTCGACAT CCCCGGGGTG AGCTTCTATT CGCTCCAGGT GGGTGAGGAG GCGCAACAGG CAAAGGAATT TCCCTCCCTC ATCGATTTCA CCCCCGGCAT CGCGGACTTC GGCGACACGG CCGCTCTCAT CGCGCAGCTC GACCTGGTCC TCTCCATCGA CACCTCCGTG GCCCACCTGG CAGGCGCACT GGCGAAGCCG GTCTGGCTGC TGCTCCCCAA GGCGGGCGAC TACCGCTGGC TCACCGAGCG CGAAGATTCC CCCTGGTACC CGACCATGCG CCTTTTCCGG CAGAAGCTGC AGGGAGAGTG GGGGGAGGTG GTCGAACGCG TGAAGGAGGA ACTGGAGCCG TCGGCCTGGG GCTTTTTGGA AAAAGCTGCC GCGGCGCAGC CGTTCAACGG CCGCAGACAC TACCTCTGCG GGCTCTTCCT CTCCTTCGAA AAGAGGGAGC GCGAGGCGAC GGTAAGGTAC AGCAAGGCGG CGCAGTTGAT GCCCGGAAGC TGGGAGCCGC ACTACGCGCT CGCCTGCTCG CTGCAGCAGC TTACGCGACT TGCCGAGGCG AAGGAGAGCC TTGTGGCGGC GCTCGTCTTG GAGCCGCGCC TTCCCCTCTT GCACGAGGCT TTCGGCATCC TGTGCCAGAT GCAGGACGAC CCCGAGGGGG CGGCGCGCGC CTACCGGGAG GCGCTGGCGC TGGACCCGGA CGCGGTCAAG GCACGCTACA ACCTGGCCAC GCTCTGCAAG GAGAAAGGGC TCGCAGCCGA GGCTCTGCAA GGTTTCCGCG AGGTGGTGCG GCGCGAGCCG GAGCATGCCG ACGCGCATTG GAACCTGGCC GTGATGCTCC TCATGACCGG GGAGTTCGCC GAAGGTTGGC GGGAGTTTCC CTGGCGCTTC AAAAAGAGCC TCTCTCCCCC GGTGCGCCGC TGGGAGGAGC TGCCGCGCTG GGGACGGCTC CCCGCTTGCC GGTGCGACCG TCCTGCTCTA CGGGGAGCAG GGGGCCGGCG ACACGCTGCA GTTCGTGCGC TACGCCCCGC TGGTGGCAAA GCGCGGCGGA CGCGTGCTCA TCGAGGTGCA GTCGCGGGGG CTCGTCGAGC TGGTGGCGAC CGTCGCTGGC GTCAGCGGCG TCTTCGCCTG CGGCGACCCC CTCCCCGCGT TCGAGTGGCA GGCCTCGCTG ATGGATCTTC CCGGCATCTT CGGCACCGAG CCCGGCACCA TCCCGGCCGC CATCCCCTAT CTCGTGGTCG ACCCCGGGCG CCGCGACTCG CTGCGCCGTC TCTTCGAGGC TGA
|
Protein sequence | MPASSRADSP AETFAAALDL QKSGHRGEAE RLYRALAASG GELAADACIN LGALLDESGR AEEALEKYRE ALALREGDPL ALNNAGSTLF KLGRFTEAAQ LFRHALERAP DSLEAQVALG AALQRDGDLP AALAVFRDLV ARRPDCAEAH WNLALALLLA GEFREGWQEY QWRWRRDSFT SPRRELAAPA WDGTPLQGRR ILVHGEQGLG DTIQFARYLP MVAAAGGVVV AECQSPSLVP LLRSIPGVAE TCVMGETLPP YDLEVALLSL PHLFGTTLEN VPSGVPYLAP PQDRIAPWRE KVAADLGFKV GLVWAGKPVP DPFRSCTLAA LSPLFDIPGV SFYSLQVGEE AQQAKEFPSL IDFTPGIADF GDTAALIAQL DLVLSIDTSV AHLAGALAKP VWLLLPKAGD YRWLTEREDS PWYPTMRLFR QKLQGEWGEV VERVKEELEP SAWGFLEKAA AAQPFNGRRH YLCGLFLSFE KREREATVRY SKAAQLMPGS WEPHYALACS LQQLTRLAEA KESLVAALVL EPRLPLLHEA FGILCQMQDD PEGAARAYRE ALALDPDAVK ARYNLATLCK EKGLAAEALQ GFREVVRREP EHADAHWNLA VMLLMTGEFA EGWREFPWRF KKSLSPPVRR WEELPRWGRL PACRCDRPAL RGAGGRRHAA VRALRPAGGK ARRTRAHRGA VAGARRAGGD RRWRQRRLRL RRPPPRVRVA GLADGSSRHL RHRARHHPGR HPLSRGRPRA PRLAAPSLRG
|
| |