Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3825 |
Symbol | |
ID | 8139199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4407566 |
End bp | 4409335 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644871442 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003023600 |
Protein GI | 253702411 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.0624707 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGTACC AGGAGAGCGC GGAGAGCCAG GCGCGGCTTA AGTACGTGGA CCTGCTGCTC AAGAAGGGGA TGCAGGAGGA GGCCCGGACC CAGTTGCAGC AGCTGGTGCA GGAAGACCCC GGCTGCGCGC GCGGCTGGTA CCTCCTCGCC GTGCTGGTGG GGGAGCAGGG ACACCCGGAC CAGGCGGCGA AGCTCCTGAG ACAGGCGCTT AGGGCCGAGC CGGAGAACGT CAAGGCGCTG AACGCGCTCG GGGTGGCCCT GCAGCAGATG GGGGAGCGGG ACCAGGCCGC GGCATGCTAC GGCGAGGCGC TCCGCATCGA TCCCCGGTTC CAGGAGGCGC GGGTGAACCT CGCCCTCTTC CTCAAGGTGG GGATGAGGCT TGCCGAGGCC GAGGCGCTCC TCTCCCGGGG GATCGCGCTC GAGCCCGCAT CGGTGCGGCT TCGCTACAAC TACGCCAACG TGCTCCATTA CCAGGGGAGA AGCCTGGAGG CGGCCGGCGC CTACCGGGAG GTGCTCCGCC TGGACCCGCA GCACCTGGAC GCCAGGCAGA ACCTCCTTTT CGCGCTGCAC TACTCTCCGC AGTTCTCCGA CCGCCGGATC TTCGCCGAAC ACCTGCGCGC CGCGAGAAGC GCCCCCTTCC GCCTCCCCCC CTCCCCTTCC GTCCCGCGCC GAGGCGGGCG CATCAGGATC GGCTACCTCT CCCCCGACTT TCGCAGCCAC GCCGTCGCCT CGTTCATCGA GCCGGTGCTC AAGGCACACG ACCGGGAGCG CTTCGAGATC TTCTGCTACG CGAACCTCCC CCGCCCCGAC CGGGTCACCG AGAGGGTGAA GGCTTTGAGC GAGCACTGGC GCGACCTCTA CAACATCCCG GACCAGATCG CCGCGCTGAT GATCGCCGCC GACGCCCTCG ACGTCCTGAT CGACCTGGCC GGCCACACCT CCGGGAACCG GCTCCCGCTT TTCGCGCGCA GGCCCGCTCC CCTGCAGATC ACCTGGATCG GCTACCCCGA CACCACCGGG CTCAAGCAGA TGGATTACCG GATCACCGAC CGCCATGCCG ACCCGCCCGG GAAAAGCGAG CGCTACCACA CCGAGACGCT GCTCAGGCTC CCGCGCAGCT TCAGCTGCTT TCTCCCGCCG CAGGAAGCCC CCGAGGTGGC ACCTGTCCCG TGCCTTGCGA CCGGCGCGGT CACCTTCGGC TCGTTCAACA ACCTGGCCAA GGTCACCCCC GAGACGATCG CCCTCTGGTG CCGGGTGCTC GATGCCGTCC CCGGCTCGCG CCTGCTGTTG AAGGGGAGGC CCTTCGCCGA CAGCGGGGTA CGGGAGAGGA TCGCATCCCT GTTCGCCAGA GGAGGGATCG CGGGGGAGCG GGTCGAGCTA CACCCGGGCG AGCCGGAGAA TTCGGCGCAC CTGGCGCAGT ACGGGCGGGT CGACATCGCC CTCGACACCT TCCCCTACAA CGGCACCACC ACCACCTGCG AGGCGCTCTG GATGGGGGTC CCGGTGGTGA CCCTCGCGGG TACGAGGCAC GCGGCGCGGA CCGGCGCGAG CATCCTTACG AACTGCGGGC TCGATGAGCT GGTGGCCGAG GACGAGGGGG AATACCTGGA GATCGCCCGG CGGCTGGCGG CGGATCGGGG GAGGCTTTCG GAGTTCAGGA AGGGGGCGCG GGAAAGGCTC GCGGCGTCGC CGCTACTGGA CGCGGCGGGG GTGACGCGGG AGTTGGAAGC GGCCCTGGAG GGGGTCCTCA AGGAGCGCGG GGTGCGTTAG
|
Protein sequence | MGYQESAESQ ARLKYVDLLL KKGMQEEART QLQQLVQEDP GCARGWYLLA VLVGEQGHPD QAAKLLRQAL RAEPENVKAL NALGVALQQM GERDQAAACY GEALRIDPRF QEARVNLALF LKVGMRLAEA EALLSRGIAL EPASVRLRYN YANVLHYQGR SLEAAGAYRE VLRLDPQHLD ARQNLLFALH YSPQFSDRRI FAEHLRAARS APFRLPPSPS VPRRGGRIRI GYLSPDFRSH AVASFIEPVL KAHDRERFEI FCYANLPRPD RVTERVKALS EHWRDLYNIP DQIAALMIAA DALDVLIDLA GHTSGNRLPL FARRPAPLQI TWIGYPDTTG LKQMDYRITD RHADPPGKSE RYHTETLLRL PRSFSCFLPP QEAPEVAPVP CLATGAVTFG SFNNLAKVTP ETIALWCRVL DAVPGSRLLL KGRPFADSGV RERIASLFAR GGIAGERVEL HPGEPENSAH LAQYGRVDIA LDTFPYNGTT TTCEALWMGV PVVTLAGTRH AARTGASILT NCGLDELVAE DEGEYLEIAR RLAADRGRLS EFRKGARERL AASPLLDAAG VTRELEAALE GVLKERGVR
|
| |