Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4066 |
Symbol | |
ID | 8139440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4650002 |
End bp | 4651513 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871682 |
Product | Integrase catalytic region |
Protein accession | YP_003023840 |
Protein GI | 253702651 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 133 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGACCA TCCGTAAAGT CCGTAAAGCC CATTTCGTAG ACGGCAAGGG AATTCGCCAA ATAGTCCGGG AGTTCAAGCT CGCCAGGAAC ACCGTCCGGG ACATCATCCG CAGCGGCAAG ACCGATCAGA AGTATGAACG CAGCAAGCAG CCGCGCCCCA AGCTGGGGTT GTTCGCCGAC CGGGTGTCAG AGCTGCTGAC GGACGATAGC GCCAAGCCGG TCAAGCATCG CCGCAGTGCA CAGATCCTCT TCGAGCAGCT GCAGCGGGAA GGATACGAGG GCGGCTATGA CACCCTGCGG CGCTATGTCG CCGCCTGGAA GCAATCAAAG GAAGCCGCTA CCGTCAGGGC GTTTATCCCG CTGGCATACG ATCCCGGAGA CGCCTTTCAG TTCGACTGGA GCTACGAGTC GGTAGAGTTG GGCGGCGTCC CGGTCGAAGT GAAGATCGCC CAGTTCCGCC TATGCCACAG CCGCAAGCCA TACTGCGTCG GCTACACCCG TGAAAGCCTG GAGATGGTGC TTGACGCCCA CGTTCGGGCT TTCGAGTTCT TCGGCGGTGT CTGCAAACGT GGCATCTACG ATAACCTGAA GACGGTCGTC ACGAAGGTGC TGATAGGCAA AGACCGCGTC TTCAATCGGC GCTTCCAGAA CCTCGCCTCG CATTACCTGT TCGATCCGGT GGCCTGTACC CCTGCAGCCG GGTGGGAGAA AGGACAGGTG GAGAATCAGG TCGGCGTGGT AAGGAACCGC TTCTTCGCCA AGCGCAGGCG CTTTGCCGAT CTCTCCGAGT TGAACGAGTG GCTGGAACAG GAATGCCGCA ACCATGGTGC AGCGGCGCGG CATCCCGAGC GCAAGGACAG GACCATCGAC GAGGTGTTCG CCGAGGAGAA GGGCCATCTG CTGACGCTGC CGGCGTCCCC CTTCGACGGC TACCAGGAGA GTGTAGCCCG CGTCTCGTCG CAGTTGCTGA TCAGCTTCGA CCGGAACCGC TACAGCGTCA ACGCCATGGC CGTCGGCAAG ACCGTTGCGG TGCGGGCCTA CGCCGACCGG ATCATCATGG TGCTCAATGA CAGGGTTGTC GCCATGCACC GGCGCTACCT CGGCCGCGAC AAGGTCATCT ACGATCCCTG GCATTACATT GCCGTGCTGG AGCAGAAGCC GGGCGCCTTG CGAAACGGCG CCCCCTTCAA GGGCTGGAAC CTGCCGCCGT CTTTGCTGGA GGTCAAGACG AACCTGGAGG GACGTCCCGA CGGCGACCGC CAGTTCGTGG CCATCTTGAG CGCCGTCCGT CGTTACAGCC TCGATGCCGT CGCTGAGGCG TGCAGCCAGG CGCTAGTTGA TAAGACAGTA AGTTCCGATG TCATCCTCGC CATACTGTCC CGAAAGCACG ACGAGCCGCA GCCGGGACCG GTCCAGGAGA CGGCGAACCT GCCGCAGCTC ACATTGGTAC CCATAGTGGA CTGCCATCGC TATGACCTCC TTCTCTCCGG AGGTGCCCAT GGGACTGCGT GA
|
Protein sequence | METIRKVRKA HFVDGKGIRQ IVREFKLARN TVRDIIRSGK TDQKYERSKQ PRPKLGLFAD RVSELLTDDS AKPVKHRRSA QILFEQLQRE GYEGGYDTLR RYVAAWKQSK EAATVRAFIP LAYDPGDAFQ FDWSYESVEL GGVPVEVKIA QFRLCHSRKP YCVGYTRESL EMVLDAHVRA FEFFGGVCKR GIYDNLKTVV TKVLIGKDRV FNRRFQNLAS HYLFDPVACT PAAGWEKGQV ENQVGVVRNR FFAKRRRFAD LSELNEWLEQ ECRNHGAAAR HPERKDRTID EVFAEEKGHL LTLPASPFDG YQESVARVSS QLLISFDRNR YSVNAMAVGK TVAVRAYADR IIMVLNDRVV AMHRRYLGRD KVIYDPWHYI AVLEQKPGAL RNGAPFKGWN LPPSLLEVKT NLEGRPDGDR QFVAILSAVR RYSLDAVAEA CSQALVDKTV SSDVILAILS RKHDEPQPGP VQETANLPQL TLVPIVDCHR YDLLLSGGAH GTA
|
| |