Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0439 |
Symbol | |
ID | 8135748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 530659 |
End bp | 533931 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868057 |
Product | Tetratricopeptide domain protein |
Protein accession | YP_003020277 |
Protein GI | 253699088 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 109 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGGA TCCGATTGCT ACTCCTGGCA GTTCCGGCGC TTCTGGTGGC GTGCCAGTCG GGCGGCGGCA GGGAAACGAT CGCACAGCTG CGCAACGTGC GGATCGAAGT CAAGGAAGAG CCGATCGAAG GCGGCCTGGA GAAGGCGATG GAAGGTTACC AGCGCTTTCT GGAGCAAACG CCCGAATCGG GGCTTACCCC GGCTGCCATC CGCCGCCTCG CCGACTTGAA GGTCGAAAAG GAATACGGCT ACCTGGCCGC CGCGACCGCG CCTGCCGCAG GGGTCGCGGC AGCGCCGCTC GGCGCACCGG AACCGGGAGA GGCCCCGTCG GCCATGGCCC CCCTCGCGTC GGACGCGAGC CAACCGGGCC TAGCCGAGTC GGATGCGGAG TTCGAGAAGA GGGCGCTCCA GGGGCCGCAG CAGGCGGGCG GGGAGCGCGA AGCGGGCGGA GAAGGACTGG AGCGGGCCGG GACCCAGGAG GCGATCGCGC TCTACCAGAA GCTTTTGGAC AAGTACCCCC ACTACGAGGG GAACGACCAG GTCCTGTACC AGATGTCCCG CTCCTACGAG GAACTGGGGC AGACCGAAGA TGCCATGGCG GTCATGCAGC GCATGGTGAA CGACTTCCCA CGCTCGCGCT ACATCCACGA GGTGCAGTTC CGCCGGGCCG AGTACTTCTT CACCCACAGG CAATACCTCG AGGCGGAACC GGTCTACAAG GGGTTGGTGG AGATCGGTCC CGAGAGCTCA TATTACGAGC TGGCCCTGTA CAAGCTGGGT TGGAGCTTCT ACAAGCAGGA GCTCTACGAT GAGGGGCTGC ACCGTTTCAT CGCGCTTTTG GACCACAAGG TGAGCACCGG ATACGACTTC GCCCAGACAA CGGACGATCT GGAGCGAAAG CGCGTCGACG ACACTTTCCG GGTTCTCAGC CAGAGCTTCT CCTACCTTCA TGGCGCCGCT TCCGCCGTCG AGTACTTCGA GAAGAACGGC AAGCGCGCCT ATGAGGACCG CGTCTACGGC AACCTCGGCG AGTTCTACTA CGAAAAGCGC CGCTACAGCG ACGCCGCGGC GTCCTACAAC GCGTTCGTCT CCCGCAATCC GTTCCACCGC GCCTCCCCGC AGTTCCAGAT GCGCGTGATC GAGATCCACA TCGCGGGCGG TTTCCCCACC TTGGTGATCG AGGCGAAGAA GGAATTCGCC AAGACCTACG GGCTGAAGGC CGAGTACTGG AAACATTTCC AGCCGGGCGA GCGTCCCGAG GTCATAGCTT TCCTGAAGAC CAACGTTACC GACCTGGCCC ACCACTACCA CGCCCTGTAC CAGGACCCGG CGCACGCCAA GGAAAGGGAG GAGAGCTTCC AGCAAGCCCT GCACTGGTAC GAGGAGTTCC TGGTCTCCTT CCCGAAGGAA GCGGAATCGC CAGCCATCAA CTACCAGATG GCGGACCTGC TCATGGAAAA CCGCTCCTTC GCCAAGGCGG CGCAGGAATA CGAAAGGACC GCCTACGACT ACCCGCGTTA CGAGAAGTCG TCTGCAGCCG GATACGCGGC CGTGTTCGCC TACCGGGAGC AGCTGAAGAA CGCCCAGGCA GAGGAGAAAG AGAAGGTCAA GCGGGAGGGG GTACGCAGCT CGCTCAGGTT CGCCGAGACC TTCCCGGAAC ACGAGAAGGC GGCGATCGTC ATGGGGGCGG CCGCGGACGA CCTCTACGAG CTGAAGGAGT ACCAGCAGGC GTTGAGCGTC GCGCGCAAGC TGATCGCGAC CTTCCCTGGC GCGGGAAGCG AGGTGCTCAA GTCGGCCTGG GTAGTGGCCG CCCACTCCTG TTACGAACTT CGGAACTACG CCGAAGCCGA GGCCGCCTAC GTCCAGGTTC TGGCGCTGGT CCCGGCCGAG GACAAGAGCA GGGAAGGCTT CAACGACAAC CTCGCCGCCT CGATCTACAA GCAGGGCGAA CAGGCTAACG CCGCCAAGGA GTACCGGCTC GCGGCGGACC ACTTCCTGCG CATCGGCCGC ATGGCCGCCA CCTCGAAGAT CCGGGTCAAC GCCGAATTCG ACGCCGCCGT GGCGCTGATC CAGCTCAAGG AGTGGAAAAC CGCGGCCACG GTCCTCACCG GGTTCCGGGG ACTCTTCCCC GGCCATGAAA TGCAGCCGGA AGTCACCAGG AAGCTCGCCT ACGTCTACAA AGAGGACGGG CAGCTGGCAC TTGCGGCCGG TGAATATGAG CGCCTGGAGA CAGAATTCAA GGACGATGAG ATCAGGAGGG AGGCACTGCT ATTGGCGGCG GACCTGCACC AGCAGGCCGG GAACAGGAAG CAGGCTCTCG CGGTGTACCG CCGCTATGTC GGGTACTTCC CGCAACCGGT GGAGGTCAAC CTGGAAATGC GCAACAAGGT CTGTGAGATC CTGAAGCTGG AGGAAGACCG GAAAGGGTAC CTGGACGAGC TCCGGGAGAT GGTCGCCATC GATGCGGCGG CAGGTCCGGC GCGCACCCCT CGCACCCGGT ACCTGGCCGG GAAGGGGGCC CTGGTGCTGG CCGAGCAGAG CTATGAGCGC TTCACCGAGG TGCGGCTGGT GAAACCGTTC GAGGCGAACC TGCGCAAGAA GAAAGAGCTG ATGAAGGCGG CCACCCAGTC GTTCAACAAG CTGCCGGAGT ACGAGGTAGG CGAGGTCACC GCCGCGGCGA CCTTCTACCT GGCGGAGATC TACGGGCACT TCAGCAAGGC GCTCACCGCG TCCGAGCGGC CGGACGACCT GGACGCCCAG GAGCTGCAAG AGTACGAAAT GGCCATCGAG GAGCAGGCGT ATCCCTTCGA GGAAAAGGCC ATCACCGTTC ACGAGAAGAA CATGGAGCTG ATATCGGTCG GCATCTACAA CGGCTGGATC GACAGGAGTC TCGGGAAACT GGCCAAGCTG CTGCCGGTCC GCTACGACAA GCCGGAGGTC CCCAGCGGCA TGATCGCTTC GCTGGAGAGC TTTGCCTACG AGATCGAGAA GCCCGCGGCG CCGGCGGCCG CGGAGGTGAA CCCGGTCATG AGCGACGCCG TAGCCCCGGC GGAGCCGGAG CGGGCCGATA CCGCCGCCTC GACTGCAGCA TCGGCGGCAG CGGCTGAAGG CTCCGGCGGG AAGATGGTCG ACGGCCGTGA CGGTGCGGGC GCCGCGCCGG TCGCCGCGCC GGAAAAAAGC AAGGCAGCTA CGTCGAAAGC CGTCCCGGCC AAGGCGAAAA AAGCTAAGCA GGCCGCCGCG ACCAAGCGGC GCGTAAAGGG AGGTAAAAAA TGA
|
Protein sequence | MKRIRLLLLA VPALLVACQS GGGRETIAQL RNVRIEVKEE PIEGGLEKAM EGYQRFLEQT PESGLTPAAI RRLADLKVEK EYGYLAAATA PAAGVAAAPL GAPEPGEAPS AMAPLASDAS QPGLAESDAE FEKRALQGPQ QAGGEREAGG EGLERAGTQE AIALYQKLLD KYPHYEGNDQ VLYQMSRSYE ELGQTEDAMA VMQRMVNDFP RSRYIHEVQF RRAEYFFTHR QYLEAEPVYK GLVEIGPESS YYELALYKLG WSFYKQELYD EGLHRFIALL DHKVSTGYDF AQTTDDLERK RVDDTFRVLS QSFSYLHGAA SAVEYFEKNG KRAYEDRVYG NLGEFYYEKR RYSDAAASYN AFVSRNPFHR ASPQFQMRVI EIHIAGGFPT LVIEAKKEFA KTYGLKAEYW KHFQPGERPE VIAFLKTNVT DLAHHYHALY QDPAHAKERE ESFQQALHWY EEFLVSFPKE AESPAINYQM ADLLMENRSF AKAAQEYERT AYDYPRYEKS SAAGYAAVFA YREQLKNAQA EEKEKVKREG VRSSLRFAET FPEHEKAAIV MGAAADDLYE LKEYQQALSV ARKLIATFPG AGSEVLKSAW VVAAHSCYEL RNYAEAEAAY VQVLALVPAE DKSREGFNDN LAASIYKQGE QANAAKEYRL AADHFLRIGR MAATSKIRVN AEFDAAVALI QLKEWKTAAT VLTGFRGLFP GHEMQPEVTR KLAYVYKEDG QLALAAGEYE RLETEFKDDE IRREALLLAA DLHQQAGNRK QALAVYRRYV GYFPQPVEVN LEMRNKVCEI LKLEEDRKGY LDELREMVAI DAAAGPARTP RTRYLAGKGA LVLAEQSYER FTEVRLVKPF EANLRKKKEL MKAATQSFNK LPEYEVGEVT AAATFYLAEI YGHFSKALTA SERPDDLDAQ ELQEYEMAIE EQAYPFEEKA ITVHEKNMEL ISVGIYNGWI DRSLGKLAKL LPVRYDKPEV PSGMIASLES FAYEIEKPAA PAAAEVNPVM SDAVAPAEPE RADTAASTAA SAAAAEGSGG KMVDGRDGAG AAPVAAPEKS KAATSKAVPA KAKKAKQAAA TKRRVKGGKK
|
| |