Gene GM21_2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2884 
Symbol 
ID8138227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3354575 
End bp3356086 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content62% 
IMG OID644870485 
ProductIntegrase catalytic region 
Protein accessionYP_003022674 
Protein GI253701485 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value7.53685e-20 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAGACCA TCCGTAAAGT CCGTAAAGCC CATTTCGTAG ACGGCAAGGG AATTCGCCAA 
ATAGTCCGGG AGTTCAAGCT CGCCAGGAAC ACCGTCCGGG ACATCATCCG CAGCGGCAAG
ACCGATCAGA AGTATGAACG CAGCAAGCAG CCGCGCCCCA AGCTGGGGTT GTTCGCCGAC
CGGGTGTCAG AGCTGCTGAC GGACGATAGC GCCAAGCCGG TCAAGCATCG CCGCAGTGCA
CAGATCCTCT TCGAGCAGCT GCAGCGGGAA GGATACGAGG GCGGCTATGA CACCCTGCGG
CGCTATGTCG CCGCCTGGAA GCAATCAAAG GAAGCCGCTA CCGTCAGGGC GTTTATCCCG
CTGGCATACG ATCCCGGAGA CGCCTTTCAG TTCGACTGGA GCTACGAGTC GGTAGAGTTG
GGCGGCGTCC CGGTCGAAGT GAAGATCGCC CAGTTCCGCC TATGCCACAG CCGCAAGCCA
TACTGCGTCG GCTACACCCG TGAAAGCCTG GAGATGGTGC TTGACGCCCA CGTTCGGGCT
TTCGAGTTCT TCGGCGGTGT CTGCAAACGT GGCATCTACG ATAACCTGAA GACGGTCGTC
ACGAAGGTGC TGATAGGCAA AGACCGCGTC TTCAATCGGC GCTTCCAGAA CCTCGCCTCG
CATTACCTGT TCGATCCGGT GGCCTGTACC CCTGCAGCCG GGTGGGAGAA AGGACAGGTG
GAGAATCAGG TCGGCGTGGT AAGGAACCGC TTCTTCGCCA AGCGCAGGCG CTTTGCCGAT
CTCTCCGAGT TGAACGAGTG GCTGGAACAG GAATGCCGCA ACCATGGTGC AGCGGCGCGG
CATCCCGAGC GCAAGGACAG GACCATCGAC GAGGTGTTCG CCGAGGAGAA GGGCCATCTG
CTGACGCTGC CGGCGTCCCC CTTCGACGGC TACCAGGAGA GTGTAGCCCG CGTCTCGTCG
CAGTTGCTGA TCAGCTTCGA CCGGAACCGC TACAGCGTCA ACGCCATGGC CGTCGGCAAG
ACCGTTGCGG TGCGGGCCTA CGCCGACCGG ATCATCATGG TGCTCAATGA CAGGGTTGTC
GCCATGCACC GGCGCTACCT CGGCCGCGAC AAGGTCATCT ACGATCCCTG GCATTACATT
GCCGTGCTGG AGCAGAAGCC GGGCGCCTTG CGAAACGGCG CCCCCTTCAA GGGCTGGAAC
CTGCCGCCGT CTTTGCTGGA GGTCAAGACG AACCTGGAGG GACGTCCCGA CGGCGACCGC
CAGTTCGTGG CCATCTTGAG CGCCGTCCGT CGTTACAGCC TCGATGCCGT CGCTGAGGCG
TGCAGCCAGG CGCTAGTTGA TAAGACAGTA AGTTCCGATG TCATCCTCGC CATACTGTCC
CGAAAGCACG ACGAGCCGCA GCCGGGACCG GTCCAGGAGA CGGCGAACCT GCCGCAGCTC
ACATTGGTAC CCATAGTGGA CTGCCATCGC TATGACCTCC TTCTCTCCGG AGGTGCCCAT
GGGACTGCGT GA
 
Protein sequence
METIRKVRKA HFVDGKGIRQ IVREFKLARN TVRDIIRSGK TDQKYERSKQ PRPKLGLFAD 
RVSELLTDDS AKPVKHRRSA QILFEQLQRE GYEGGYDTLR RYVAAWKQSK EAATVRAFIP
LAYDPGDAFQ FDWSYESVEL GGVPVEVKIA QFRLCHSRKP YCVGYTRESL EMVLDAHVRA
FEFFGGVCKR GIYDNLKTVV TKVLIGKDRV FNRRFQNLAS HYLFDPVACT PAAGWEKGQV
ENQVGVVRNR FFAKRRRFAD LSELNEWLEQ ECRNHGAAAR HPERKDRTID EVFAEEKGHL
LTLPASPFDG YQESVARVSS QLLISFDRNR YSVNAMAVGK TVAVRAYADR IIMVLNDRVV
AMHRRYLGRD KVIYDPWHYI AVLEQKPGAL RNGAPFKGWN LPPSLLEVKT NLEGRPDGDR
QFVAILSAVR RYSLDAVAEA CSQALVDKTV SSDVILAILS RKHDEPQPGP VQETANLPQL
TLVPIVDCHR YDLLLSGGAH GTA