Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2970 |
Symbol | |
ID | 8138313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3451871 |
End bp | 3453055 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644870568 |
Product | transposase IS4 family protein |
Protein accession | YP_003022757 |
Protein GI | 253701568 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.0000012671 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACACT CCCTTAGGAC TTCCACCCTC CGTGCCCGTC TAAGATTCAA AAAGGCTTTT AAGCCCATGT ACGATCCGGT ACAAAGGTAC CTTAGCGTCA TTACTCCGCT TACCTCCAAA GGGGATCGTC CATTGCAACT CTCTTTCGAA GACCAGCTTA AGGCCCTTAT CTACTTCCAC CTTCACGAGT TCTCCTCCGG TAGGGAACTG TTACAAGCTC TGGAGCAGGA CGATTTCGCT AAAGAGTGTG TCGCACCACC CAAGGGAATT AAGAAAAGCG CTTTCTTTGA AGCTGTCAAC AACCGTGGCT TGGAACAACT AGCAGAGCTT TTTAAGCTTT TACTTAAGGA CGCTAAAAAC GTCATTCCAG CCGAGTTTGC AGATATTGGA AATTTAGTTG CCATAGACGG TTCATATATC GACGCCGTCA TGTCTATGGA TTGGGCGGAT TATTCAAGCA CCCACAACAA AGCCAAAGCC CATGTTGCGT TTGATATAAA CCGTGGCATC CCGAAAGATT TAATCCTCAC CGACGGCAAC CAGACCGAAC GCCAATTCGT AGAAAGGATG ATCGGCCCCG ATGAAACGGC TGTATTGGAT CGAGGCTATC AGTGCAACGC TAACTTTGAT CAGTGGCAGG AAAACGAAAA AAAGTTCATC TGCCGCATCC AAGCAAGATC TAACAAAAAA GTCATACGCG AAAACCCCAT CGCGCGGGGT AGCATCATCT TCTACGATGC TGTTGTCCTT CTCGGAGCAC CATCTACTAG GGCGAAAAAA GAGGTTCGCG TGGTTGCTTA CCGGGTTGAG GGCAAAGATT TCTGGATTGC GACTAACCGT CATGATTTGA CTGCGCTGCA AATCGCTGAG GCTTACAAGC TGCGTTGGCA CATTGAGAGC TTTTTCGCAT GGTGGAAACG ACATCTCAGC GTTTACCATC TCATCGCCAG GAGCCAGTAT GGCTTAACAG TCCAAATACT CAGTGGGCTC ATCACCTACC TGCTTCTGGC GATGTACTGC CAGCGAGAAC ACAACGAGCC AGTAAGTGTT CACCGTGTTC GGGAACTGCG GCATCAAATG GCCCGCGACG CAGTCGCAAT GACATCACAA ACGCCGTCTC CCAAAAGGGC AAAACTGCAA AGAAACACAC GATTGCTGAT GAAGCGACGT AAAGCAAAAA CCTAA
|
Protein sequence | MRHSLRTSTL RARLRFKKAF KPMYDPVQRY LSVITPLTSK GDRPLQLSFE DQLKALIYFH LHEFSSGREL LQALEQDDFA KECVAPPKGI KKSAFFEAVN NRGLEQLAEL FKLLLKDAKN VIPAEFADIG NLVAIDGSYI DAVMSMDWAD YSSTHNKAKA HVAFDINRGI PKDLILTDGN QTERQFVERM IGPDETAVLD RGYQCNANFD QWQENEKKFI CRIQARSNKK VIRENPIARG SIIFYDAVVL LGAPSTRAKK EVRVVAYRVE GKDFWIATNR HDLTALQIAE AYKLRWHIES FFAWWKRHLS VYHLIARSQY GLTVQILSGL ITYLLLAMYC QREHNEPVSV HRVRELRHQM ARDAVAMTSQ TPSPKRAKLQ RNTRLLMKRR KAKT
|
| |