Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1408 |
Symbol | |
ID | 8136736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1655234 |
End bp | 1656517 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869022 |
Product | hypothetical protein |
Protein accession | YP_003021225 |
Protein GI | 253700036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00000000450965 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAGGT ACAATGCGAT GCGGGAAATT CCCGCTACTG CCGGATTCAA GGAAGGGGAC GTATTTTTTC TCTGCGGCGA GCTCTTCGGC CGCGGCTATG CCAACGGCAT CGTCGACGAA GCAAGGGCCA AGGGGATGAC CATCATCGGG GCCACCGTCG GCAGGCGCGA CAACGACGGG ACCCTGCGCC CGCTCAACGC CGAGGAACTG GCAACGGCGG AAGAGAACCT GGGCGGCAAG ATCATCAATA TCCCGCTGGA AGCGGGCTTC GACATGGAGC CGGGGAGCGA CGGGATCGCA CCGGTCGACA GGTTCAAAGG TGTCAAACCG GACGACTGGG CCTCGGTGAA ACTGGACCAG GCCGAGATTG AATTCTCCAA AAAGCGCGGC ACCGAGCGTT TCTGCAAGAA CCTCGCGGCT GTGGTGGCCG AAGTGGAGAA GATGCTCCCG GCCAAGGGGA GGCTCCTGGT GGTGCACACC ATGGCAGGCG GCATCCCGAG GGCGCGCGTG TTCATGCCGA TCCTCAACAA GCTCTTCAAG GGACAGGGAG ACCGCTTCCT CTCCTCCGAA GCCTTCTGGA ACTCCGACAT GGGGCGCCTT TGCGACGCGA GCTTCAACGA AGTGACCGCC GACACCTTCC GCTACCTGAT CGACGCCACC GCTGGCCTCA GGGAGAAGCG CGAGGTAACC TACGCGGCCT ACGGCTACCA CGGCACCGGC GTACTCATCG ACGGCGTCGT CACCTGGCAG TCCTACACCC CGTACCTGCA GGGGTGGGCG AAGATCCGCC TGGAAGACAT CGCCATCGAG GCATGGGAAA AAGGGATCAA GGCAACCGTC TACAACTGCC CGGAGATCCT CACCAACTCC AGCGCCCTCT TCCTCGGGGT CGAGAATTCC CTCTATCCGC TGATGGCCTC GCTCAGGGCC GAAGGGGAGC AGAAGATCGT CAAGGAGTGC GAGGCGCTCT TGAAGGAAGG GGCCACCGTC GACACGCTGC TCGACATCGC CAACACCTAC CTCACCTCGG ATCTGGTCAC CAGCACCCGG GACTTCGACA GCTGGCCGCA GCACAACCAG CCGCAACAGC AGGAGTACAT GCTGAACGTG TCGGCGGAGC TGATCAGCCT GAACGCGGAC CCGAAGGAGA TCGTCTGCGC CGTCCTCTCC AAGGGGGTGT TCCAAGGGGT AGGGAAGCTG ATGTTCGACA GTTCCTGGGA GCCGAAGGCC CCGGTCTTCT GGTTGAACCA CGACGTGATC GCGAAGACGC TGGTGAAGAT GTAA
|
Protein sequence | MSRYNAMREI PATAGFKEGD VFFLCGELFG RGYANGIVDE ARAKGMTIIG ATVGRRDNDG TLRPLNAEEL ATAEENLGGK IINIPLEAGF DMEPGSDGIA PVDRFKGVKP DDWASVKLDQ AEIEFSKKRG TERFCKNLAA VVAEVEKMLP AKGRLLVVHT MAGGIPRARV FMPILNKLFK GQGDRFLSSE AFWNSDMGRL CDASFNEVTA DTFRYLIDAT AGLREKREVT YAAYGYHGTG VLIDGVVTWQ SYTPYLQGWA KIRLEDIAIE AWEKGIKATV YNCPEILTNS SALFLGVENS LYPLMASLRA EGEQKIVKEC EALLKEGATV DTLLDIANTY LTSDLVTSTR DFDSWPQHNQ PQQQEYMLNV SAELISLNAD PKEIVCAVLS KGVFQGVGKL MFDSSWEPKA PVFWLNHDVI AKTLVKM
|
| |