Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1009 |
Symbol | |
ID | 8136331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1188998 |
End bp | 1190668 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644868622 |
Product | hypothetical protein |
Protein accession | YP_003020830 |
Protein GI | 253699641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 116 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACAAC TTCTGCGCGA TAGGATCACC GCCACGGTAG CTATTTCCTT TCAACAGGGC TCCCTCGGGA GCACCGACCA TATGTCCAAG ACTCTGGCCG CCGTACGCCG CAATTTCGAC AATCTCGGAG GCCCGCCGGT ACAGGGGCGT ACCATAGCCG ATGCGGTCTC CAGCTTTCGC GCCACCGGGA AACTACTTTC CTTCAGCAAC GCCAAATATG TCTGCATCGG CGTCAGTCAC GAATTACCAG ATGGTTGGCG CCTGATTGAA GACGAGGGTA TTTTCTCTAC ACTTATCGAT GAAGTTCGCC TGACGGCCCT CCGCCCACGA CTATTTTTAA GATGCTATCA AGGGCTTCTT TATAATTATT TCAACGTGTA CACTAAGAGC TCTTCCGAGT CTTGCGCCGG CAACTGGCTC GCGCTAAACA ACTTCCTCAA GACAGACCTT AAGCAGGTAC AACAGACAGC TATTCCCCCG ACTTGGTTGA AAATGCTCAC GGCCAACGAG AACCTCCTGG AAGATAATCC CTGCCGCAAG TACGGTAACA TACTGGCGGA GGGCGGACAG AAGGAGTTTA GGGGCATCTG CGAGGCAGTC GGTATTTTAT CTGGTTCCTG GCTTCTGGAG GAGGCAGTCT TCTCACAGGT CGAGGCGATC TGCACCTATG ATGATTCCCC CTTCGCTGAA AAGCTCGACG AAATGTTGCT TCTCTTAGTC GGGCAAGGGA ACGTGAGCTA CTCGAAAAGG CTGATCGTCA GGTGCCTCGC TGCGCTGGCC ATCAGATATT CACGATGCCG GGAACATCCG GACCACAGCA TTCTCCGAGA TACCCTGATT AAGCACATCG GCAATCCTTG GCTGGACAAG ACCGCTTGGG ATGTCTCGGT AAACGATGAG CCTGCCCGGA TCATGGTGGA CAGTTGGCTC AAACGCCACT TGATCCACGC GTTTTTCACT CTCCTGTCAG AGGACGGGGC AACGGATCAG CGGCGCCTGG ATTACTGGCT ACGGTATTAT GAGAGAATAA ACGACCTCTG GTTCGTGTTG GGCAGGGACG CCAGGAAAAA CAGTAGCTCC GACTTCGTTA AGATCCGCGC CCTGGCGAGG GATCACCTGT TGTATCTGGA TGGGGGCGGT GCCTCCAATA ACGCCTTCAT CATGAAAATC GGGGATAAAT ATGCGGTAGA GTTCGGGCTG ACGGGAAACG CCTGCTACAT CTTCAATGAA GACCGGCTGC CTTTCGATCC AACACGCATG CAGCGGACCT ACAACGTAAC AGATCTTAAA TCTAAGTCCC ACGGAAAACA AATGATTCAC AGCGATGGGC ACCAGAGGTG GGAGAGCATC TTCGACGATT ATCTGAGCCC CAGAATCGGC TGGCGACCGG GTACGCCTGT GCAACAGCCC TCGCACACCA GGGCCCACGC ATCCTACGAC CGCCTCAGTA ATCAGCACGT CCAGCCGCCA AAGGTAAATG GGCCGCTCTC CGCAGAGGGT TACCGGGAGG TTTGGCAGAT GGCGGCGGAC CGCTTTTACG GGATGACAGA CAAACGAAGT AGTGGTGGAG GGCTATGGGT GGACGCACCC AATAACATCG CCTTCGTTAG CTCCATTCTG ACGAAGCACG GGTTCAACTT CAGACAGGAT AAAGGCTGGT ATAGGGAGTA A
|
Protein sequence | MIQLLRDRIT ATVAISFQQG SLGSTDHMSK TLAAVRRNFD NLGGPPVQGR TIADAVSSFR ATGKLLSFSN AKYVCIGVSH ELPDGWRLIE DEGIFSTLID EVRLTALRPR LFLRCYQGLL YNYFNVYTKS SSESCAGNWL ALNNFLKTDL KQVQQTAIPP TWLKMLTANE NLLEDNPCRK YGNILAEGGQ KEFRGICEAV GILSGSWLLE EAVFSQVEAI CTYDDSPFAE KLDEMLLLLV GQGNVSYSKR LIVRCLAALA IRYSRCREHP DHSILRDTLI KHIGNPWLDK TAWDVSVNDE PARIMVDSWL KRHLIHAFFT LLSEDGATDQ RRLDYWLRYY ERINDLWFVL GRDARKNSSS DFVKIRALAR DHLLYLDGGG ASNNAFIMKI GDKYAVEFGL TGNACYIFNE DRLPFDPTRM QRTYNVTDLK SKSHGKQMIH SDGHQRWESI FDDYLSPRIG WRPGTPVQQP SHTRAHASYD RLSNQHVQPP KVNGPLSAEG YREVWQMAAD RFYGMTDKRS SGGGLWVDAP NNIAFVSSIL TKHGFNFRQD KGWYRE
|
| |