Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4094 |
Symbol | |
ID | 8139468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4673781 |
End bp | 4675349 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644871709 |
Product | PBS lyase HEAT domain protein repeat-containing protein |
Protein accession | YP_003023867 |
Protein GI | 253702678 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 141 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGGTTG TGAAGAAAAA ATTGCTGAAG CTGGTCCCGC CGTCGGAGAA TCCGGGGTTC CTGGTCCGGG CCCTGGTCGA GTTGCACAAG GCGGTGAAGG GGGCCGGGTT CTATCCCGTG GGGCACCCGT ACCGCATCGA AACCCTGCAG CGAGCCTACG ACTGCCTGAA GAAGCTCGTG TCGGACCGCG AACTGGTTCT CGGCGTGAAC CGGCAGGGTT TCCTCCTGGC CGGGGAACCG ATCGAGGGGA ACAACATGGT GCAGCAGCTG GCACATGAAT GTTTCATCCG CAGGATCGGC AACATCAGCT TCATGCAGGA CCTTATCCCC GGCGACCTGG GGGTCTTCGT GCAGCTGCTC AACTGCGACC CGCAAAAGGG AGCCGCCGCG GGCGGCCTCG CCAAGGAACT CGAAGACAAA GGGGTGCGCA CCGTCTGGGT CAACGAGAAG GACTTAGCTT CCATCTGGGC CAAGCGTCCC GGATACCAGG AAAGGGTGCA GGAGGGTTGG GACAACATCC CTTCTCTTGC GTTGCCTGTG ACACCGGTGA GCCGGCAGCG CGGCATCGGC GAACTGTTGG TGCTGATGGC GGAGGAACAA AACGATGCCC GCTACCAGGA GCTGGGGCGG GAGCTGGTGA CAGGGTATCA GGCCGACCCG CGGCAGGTCC CGGTTCTTAC CATCCTTGAG GAGCTGCTGC GCCAGCACCA GGAGCCTGAG CGGAGCCTTC CCCAGAGGGA GTACGCGCTC TTCACGCTGG ATCATGTGGC GGACGGCGCC GCGGACCAAC TGCTGAACGC GCTGGAGAGC CGCGAATGCG AGGAGCGGGA ATCGATCCAC CGGGTACTCA TCGCCCTCGG GGGCAAAGGG GCGTACTGGG TGATCCAGAG GATCTGCCTC GCCGAGGGGC TCTTCGAAAG GAAGTCGCTG GCCGCGGCGC TCGTGGCCAT GGGGCAGTCC GCCATAGCGC CCTTGATCGC GATGCTCAAG GACGAGCGCT GGTACGTGGT GAGGAACATG GTGGCGATCA TCGGGGAGTT GCGCTGCACC GACTGCGTCC TCGCACTGAA GCGGCCTCTG TACCACCACG ACGTCCGGGT ACGCAAGGAA GCGATCCGGG CGCTTATGAA GACGGGGGGG GAAGCGTCGG TACTTTTGCT GGTGCCGCTT CTGGACGAAG AGGACGAGGG GGTGGTACGC CACGCCATAC TCTCCCTGGG CCTGATGCGC AGCCGCGAGG CGGTGCCTGC GTTGTTGAAG CTTTTGGATC GCCGCGACAT CCTCCTGAAG GAACTCGGGG TGAAGAAGGA AGTGGTGACC GCTCTCGGGC GCATCGGCGA CCGCAGGGTC ACCCCGCAAC TGCTCAAGAT GCTCGGCACC CGCGGCTGGC CCGTGCTTGG GCGGTGGCTC GAACTGAAAG TTGCGGTGGC CTCGACGCTG GGCATGCTGG GGGACGAGAC GGCCATCGCC GCGCTCACCT CGCTAGCCCG CGGCTCCGGC GCGCTCGCCG AGGCTTGCCG CGAGGCGTTG GATGCCATCG AAAGGATCTC CGGAGGGACC CATGACTGA
|
Protein sequence | MPVVKKKLLK LVPPSENPGF LVRALVELHK AVKGAGFYPV GHPYRIETLQ RAYDCLKKLV SDRELVLGVN RQGFLLAGEP IEGNNMVQQL AHECFIRRIG NISFMQDLIP GDLGVFVQLL NCDPQKGAAA GGLAKELEDK GVRTVWVNEK DLASIWAKRP GYQERVQEGW DNIPSLALPV TPVSRQRGIG ELLVLMAEEQ NDARYQELGR ELVTGYQADP RQVPVLTILE ELLRQHQEPE RSLPQREYAL FTLDHVADGA ADQLLNALES RECEERESIH RVLIALGGKG AYWVIQRICL AEGLFERKSL AAALVAMGQS AIAPLIAMLK DERWYVVRNM VAIIGELRCT DCVLALKRPL YHHDVRVRKE AIRALMKTGG EASVLLLVPL LDEEDEGVVR HAILSLGLMR SREAVPALLK LLDRRDILLK ELGVKKEVVT ALGRIGDRRV TPQLLKMLGT RGWPVLGRWL ELKVAVASTL GMLGDETAIA ALTSLARGSG ALAEACREAL DAIERISGGT HD
|
| |