Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0333 |
Symbol | |
ID | 8135640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 413991 |
End bp | 414860 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644867950 |
Product | hypothetical protein |
Protein accession | YP_003020172 |
Protein GI | 253698983 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0177] Predicted EndoIII-related endonuclease |
TIGRFAM ID | [TIGR02757] conserved hypothetical protein TIGR02757 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 3.3662299999999996e-20 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAGCT CGCTGGATCT TAAAATCGTA CTGGAATCGC TTTATGCCGC GCGCTCGAAG GAGCACCTCG CCAACGACCC GCTTTCGTTC TGCCACCGCT ACGCAGACCC GGGGGACCAG GAGATTGCCG GCCTCATCGC CTCATCGTTC GCCTACGGCA ACGTGAAGAT CATCAAGAAG AACCTCGCCT GGATCTTCGA CCGGATGGGG GGCTCCCCGC GCCTTTTCGT GGAGCGGTTC GAGCCGGAGC AAGGCTCCCG CCTCTTCGCC GGTTTCAAGC ACCGCTTCAA CGACGCCCGC GACCTCTGCG CGCTCCTTTT CGCCTGCCGC ATCATGCTGG AGCAGGCGGG GTCCGTGGAG GGATGGCTGC TTCAGTTCCA CGGCCAGGGG GAGGAGGACC TGACCGGCAC CCTCACCGGG TTCAGCGACG CGGTGAAGTC CCTCGATTTC ACCCCGGTCT TCGGCGCCCC CTCCCCTCCC GCCGACTCCT ACTTTCCCTT CTTCTTCCCC TCCCCCGCCT CGGGGAGCGC CTGCAAAAGG CTCTGCATGT ACCTCAGATG GATGGTGCGG CCGGCCGACG GCATCGACCT GGGGATCTGG AAGGGGATCA GGCCGGACCA GTTGGTCATC CCGGTGGACG CCCACATCCA GCGCATCTGC AGGCTCCTCG GGTTCACCGC CCGCAAGCAG GCGGACTGGC GCATGGCCCG CGAGATCACG GCCGCGCTGC GCGAGCTCGA CCCGGCGGAC CCGGTCAAAT ACGACTTCTC CATCTGCCAC CTGGGTATTT CCGAGGGTTG CGACGGCAAG GACCACTTGA AGTGCGTAGC CTGCCCCATC GCCGGGCTCT GCCCGGTGGG GTCGCCTTGA
|
Protein sequence | MRSSLDLKIV LESLYAARSK EHLANDPLSF CHRYADPGDQ EIAGLIASSF AYGNVKIIKK NLAWIFDRMG GSPRLFVERF EPEQGSRLFA GFKHRFNDAR DLCALLFACR IMLEQAGSVE GWLLQFHGQG EEDLTGTLTG FSDAVKSLDF TPVFGAPSPP ADSYFPFFFP SPASGSACKR LCMYLRWMVR PADGIDLGIW KGIRPDQLVI PVDAHIQRIC RLLGFTARKQ ADWRMAREIT AALRELDPAD PVKYDFSICH LGISEGCDGK DHLKCVACPI AGLCPVGSP
|
| |