Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1246 |
Symbol | |
ID | 8136571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1455322 |
End bp | 1456671 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644868860 |
Product | hypothetical protein |
Protein accession | YP_003021065 |
Protein GI | 253699876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0000000000280967 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTACTG CAACTGCAAT CCGTCCCAAA GACAGGGATG CTGTCATTCA ATCACTCCGC GCCGGAGTTG TTCCCCGCAT AGGCCAACAT CTTATCCAAG TTGGTCGAGT CAACGAAATC GCGGCTCTTA TAAAGGATAT CGACCGCATA ACCGATAATG GCTCCAGCAT CCGTTTTGTC ATAGGCGAAT ACGGTTCCGG TAAAACCTTC TTTCTGAATC TGGTCAGGGC TATTGCCCTT GAGAAGCGAC TTGTCACCGC CCATGCTGAC CTGAACCCGG ACAGGCGGCT TCATGCAACA GGTGGACAGT CCCGCTCTCT TTACGCAGAG ATGATGCGAA ATATCTCAAC TCGCTCCAAA CCCGACGGTG GGGCGATGCA GAGCATTGTG GAACGCTTTG TGACCGAAGC ACTCAAGGAG TCCAAGTCGT CTGGGAAGAA TCCGGAAGCT GTCATTCACG AGAACCTGGA CATCCTCTCG GAGATGGTCG GCGGCTACGA TTTTGCCCAT GTCATCGCTG CATACTGGAA AGGCCATGAC ACCGGTAATG AACAGTTGAA GGCTGATGCA ATCCGCTGGC TCCGGGGCGA ATTTACGACT AAAACTGACG CCAGAAATGC CCTCGGAGTG AGAACCATAA TTGATGACGG CTCCTTCTAT GACCATCTAA AGCTGATGTC ACGTTTCGTG AGACTCGCGG GATTCGAGGG GATGTTGATT TGCCTGGATG AGCTGGTGAA CCTTTATAAG CTATCGAATG TTCAGGCCCG CAACTCCAAC TATGAACAAA TTCTACGTAT CCTCAACGAT TCATTACAGG GCACCGCTGT TGGTCTTGGC TTTATCATGG GAGGCACGCC CGACTTTCTC ATGGACACAC GCCGGGGTCT CTACAGCTAT TCCGCTTTAC AATCACGCCT TGCAGAAAAC ACCTTCGCTG TGAAGGGACT AGTTGACTAC AGCGGGCCCG TTCTGCGGCT CAGCAACCTC ATCCCGGAAG ATTTTTATAT CCTGTTGGGC AAAATACGTC ACGTCTTTGC CTATGGTCAT CCCACTGACT ATCTCCTGCC CGACGAAGCC CTTAAAGCTT TCATGGACCA CTGCTCCAAA AAAATAGGTG ACGCCTATTT TCGGACTCCC CGCAATACCA TCACGAGTTT TGTAAATCTT CTTTCGGTTC TAGAGCAAAA CCGTCAGACC ACGTGGAGAG AAATTCTTGG AACTGTGGAT GTCAAACCAG ACATTAACCC GGACCTGGAT ATTCCGCTGG AGGAGACAAC AACGCCTGCC GTAGCAGTTG CCAGTATCCC TGCTGATGAA GAGGAAAGCC TTGCAGCATT CAAGCTCTGA
|
Protein sequence | MSTATAIRPK DRDAVIQSLR AGVVPRIGQH LIQVGRVNEI AALIKDIDRI TDNGSSIRFV IGEYGSGKTF FLNLVRAIAL EKRLVTAHAD LNPDRRLHAT GGQSRSLYAE MMRNISTRSK PDGGAMQSIV ERFVTEALKE SKSSGKNPEA VIHENLDILS EMVGGYDFAH VIAAYWKGHD TGNEQLKADA IRWLRGEFTT KTDARNALGV RTIIDDGSFY DHLKLMSRFV RLAGFEGMLI CLDELVNLYK LSNVQARNSN YEQILRILND SLQGTAVGLG FIMGGTPDFL MDTRRGLYSY SALQSRLAEN TFAVKGLVDY SGPVLRLSNL IPEDFYILLG KIRHVFAYGH PTDYLLPDEA LKAFMDHCSK KIGDAYFRTP RNTITSFVNL LSVLEQNRQT TWREILGTVD VKPDINPDLD IPLEETTTPA VAVASIPADE EESLAAFKL
|
| |