Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0943 |
Symbol | |
ID | 8136264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1117060 |
End bp | 1119021 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868558 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003020767 |
Protein GI | 253699578 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0000000395442 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA GACGCAAGTT AGCAAAGCTT GAGCCGCAAG CCGGGCCGAA GGCACCGCCC GCGGCCGAGG AGGAGTTGAA CCTCTGGCGC GATCCCCTGG TTCATGTCCT GCTGGTGCTG GCGCTGGGCT TCGCGGTCTA CTCGAACATC ATAGCCGCGC CGTTTGTCTT CGACGACCTC CCTTGCCTGG TCAACAACCC GATCATCAAG GACTTCTCCT TCTTCGCCGA CCCGAAGCAG GTGTTCGGGC TCCCGATCAA CCCGGACCTG AAGAACAACT TCATCCTGCG CCCGGTAGCC TACTTCACCT TCGCGCTGAA CCACGCCTTG CACGGGCTGG ACGTGCGGGG TTACCACATC GTCAACCTCC TGCTGCACAT GGCCGACGCC CTGCTGGTGT ACCTCGTTTC GTGGCTCACC CTGAGGACGC CGGCGCTGCA ACCGGAGCAG GGTAAAGAGG CGGATCCCCC GACGGAGAAA TACTTCTATC TCCCGTTTCT GGCGAGCCTT TTGTTCGTCT GCCACCCGCT GCAGACGCAG TCGGTCACCT ACGTGGTGCA GCGCTTCGTA CCCCTGGTCG CCTTTTTCTA CCTGGGGTCG CTGGCGCTGT ATGCCGCGGC GAGGCTCTCG GAGACAAAGG GGATTCGGGT CGGCTGCTAC CTCGGCTCGC TTTTCGCCTG CGTCCTCGCC ATGAAGAGCA AGGAAAACGC CTTCACCCTC CCGGCGGCGA TCGTGCTGTA CGAGTTCGTC TTCTTCCGGG GCGCGGTCAC CGCCGCCCGG CTCGCGCGGC TGGTCCCGTT CCTCTTCACC ATGGCGATCA TCCCAGTCAA GCTGATGTCT CTCTCGGCCA TGGCTGCCAC GGGGGGCAAG GTGGCCGGTG CTGTCAACCT AATCAATTTC AAACAGACCT CCCCCTGGGA ATACCTGATG ACGCAGTTCG GGGTGATAAC GACCTACTTG CGGCTGCTCA TCCTGCCGGT CAACCAGAAC TTGGATTATC AGTACCCGCT GCAGAAGGTT TTCCTCGCCC CGGCCGTAGT CCTGCCGCTG CTTTTGCTGC TGGCGCTGGC AGCCGGGGGT ATTTATCTTC TGGCGACCTC GCGCAGGGGA GACGACCGGG CCGGCATGCG CGCGCTGGCC GGCTTCGGCA TCTGGTGGTT TTTCATCACC CTCAGCGTCG AATCGAGCGT GGTGCCGATT GACGACGTCA TCTTCGAGCA TCGGGCCTAC CTGCCGTCGG TAGGATTCTT CATCGCCCTG CTCGCCGCGG CGTTCTCCCT CCCCCCCCGC TTCGGCGGGA CCCCGCTTTG CACCTCGCGC CCAGCGGTCG CCGTTTTCGC CTTCCTGGTA CTCGCCAGTT CAGTCGCCTG CTACCTGCGA AACGAGGTGT GGACGACGCC GGTGGCCCTG TGGCGGGACA CGGTGCAGAA GAGTCCGGGA AAGGGGAGGG CGCACTTCTC CCTGGGGTTC GCGCTGGCCA ATACCCTGCC TCCCTGGCAC ACCGACGACA TCAACGTGAT GCTCCAGCCC ATGGACGCCG CCCAGAACCA GGTGCTGGCG GAAGCGGTCC GGGAGTTCCG CGCCTCGACG AAGCTCGAAC CCGAATCGGC GGCCGGATAT TCATTCCTGG GGGCGGCGCT GATGGTGCAA CGGAAGTTCG ACGAAGCGGC GGCCGCATTG GCGACCGCCG CCGCGCTCGA TCCGAAGGAC GCAAGGACCC GCGCGTTTCT CGGGCAGTTG AGCGAGGCCC GGGGGGATCT TGCCGCCGCT CGCCTCCAGT ACCGGCAGGC CCTTTCTCTC AGCCCCCGGG AGCCCTTCCC GCATCTGTTC CTGGCGCTTC TCTCCCTGCG AGAGGGGAGG CACGCCGAGG CACTTAAGGA GTACGAGATC GCCCACCGGC TCGCTCCCCG CCCCGACCTG GAGCCGAAGA TGGCGCAGTT GAGATTCATG GTGGGACGAT GA
|
Protein sequence | MKKRRKLAKL EPQAGPKAPP AAEEELNLWR DPLVHVLLVL ALGFAVYSNI IAAPFVFDDL PCLVNNPIIK DFSFFADPKQ VFGLPINPDL KNNFILRPVA YFTFALNHAL HGLDVRGYHI VNLLLHMADA LLVYLVSWLT LRTPALQPEQ GKEADPPTEK YFYLPFLASL LFVCHPLQTQ SVTYVVQRFV PLVAFFYLGS LALYAAARLS ETKGIRVGCY LGSLFACVLA MKSKENAFTL PAAIVLYEFV FFRGAVTAAR LARLVPFLFT MAIIPVKLMS LSAMAATGGK VAGAVNLINF KQTSPWEYLM TQFGVITTYL RLLILPVNQN LDYQYPLQKV FLAPAVVLPL LLLLALAAGG IYLLATSRRG DDRAGMRALA GFGIWWFFIT LSVESSVVPI DDVIFEHRAY LPSVGFFIAL LAAAFSLPPR FGGTPLCTSR PAVAVFAFLV LASSVACYLR NEVWTTPVAL WRDTVQKSPG KGRAHFSLGF ALANTLPPWH TDDINVMLQP MDAAQNQVLA EAVREFRAST KLEPESAAGY SFLGAALMVQ RKFDEAAAAL ATAAALDPKD ARTRAFLGQL SEARGDLAAA RLQYRQALSL SPREPFPHLF LALLSLREGR HAEALKEYEI AHRLAPRPDL EPKMAQLRFM VGR
|
| |