Gene GM21_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4066 
Symbol 
ID8139440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4650002 
End bp4651513 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content62% 
IMG OID644871682 
ProductIntegrase catalytic region 
Protein accessionYP_003023840 
Protein GI253702651 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones133 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGACCA TCCGTAAAGT CCGTAAAGCC CATTTCGTAG ACGGCAAGGG AATTCGCCAA 
ATAGTCCGGG AGTTCAAGCT CGCCAGGAAC ACCGTCCGGG ACATCATCCG CAGCGGCAAG
ACCGATCAGA AGTATGAACG CAGCAAGCAG CCGCGCCCCA AGCTGGGGTT GTTCGCCGAC
CGGGTGTCAG AGCTGCTGAC GGACGATAGC GCCAAGCCGG TCAAGCATCG CCGCAGTGCA
CAGATCCTCT TCGAGCAGCT GCAGCGGGAA GGATACGAGG GCGGCTATGA CACCCTGCGG
CGCTATGTCG CCGCCTGGAA GCAATCAAAG GAAGCCGCTA CCGTCAGGGC GTTTATCCCG
CTGGCATACG ATCCCGGAGA CGCCTTTCAG TTCGACTGGA GCTACGAGTC GGTAGAGTTG
GGCGGCGTCC CGGTCGAAGT GAAGATCGCC CAGTTCCGCC TATGCCACAG CCGCAAGCCA
TACTGCGTCG GCTACACCCG TGAAAGCCTG GAGATGGTGC TTGACGCCCA CGTTCGGGCT
TTCGAGTTCT TCGGCGGTGT CTGCAAACGT GGCATCTACG ATAACCTGAA GACGGTCGTC
ACGAAGGTGC TGATAGGCAA AGACCGCGTC TTCAATCGGC GCTTCCAGAA CCTCGCCTCG
CATTACCTGT TCGATCCGGT GGCCTGTACC CCTGCAGCCG GGTGGGAGAA AGGACAGGTG
GAGAATCAGG TCGGCGTGGT AAGGAACCGC TTCTTCGCCA AGCGCAGGCG CTTTGCCGAT
CTCTCCGAGT TGAACGAGTG GCTGGAACAG GAATGCCGCA ACCATGGTGC AGCGGCGCGG
CATCCCGAGC GCAAGGACAG GACCATCGAC GAGGTGTTCG CCGAGGAGAA GGGCCATCTG
CTGACGCTGC CGGCGTCCCC CTTCGACGGC TACCAGGAGA GTGTAGCCCG CGTCTCGTCG
CAGTTGCTGA TCAGCTTCGA CCGGAACCGC TACAGCGTCA ACGCCATGGC CGTCGGCAAG
ACCGTTGCGG TGCGGGCCTA CGCCGACCGG ATCATCATGG TGCTCAATGA CAGGGTTGTC
GCCATGCACC GGCGCTACCT CGGCCGCGAC AAGGTCATCT ACGATCCCTG GCATTACATT
GCCGTGCTGG AGCAGAAGCC GGGCGCCTTG CGAAACGGCG CCCCCTTCAA GGGCTGGAAC
CTGCCGCCGT CTTTGCTGGA GGTCAAGACG AACCTGGAGG GACGTCCCGA CGGCGACCGC
CAGTTCGTGG CCATCTTGAG CGCCGTCCGT CGTTACAGCC TCGATGCCGT CGCTGAGGCG
TGCAGCCAGG CGCTAGTTGA TAAGACAGTA AGTTCCGATG TCATCCTCGC CATACTGTCC
CGAAAGCACG ACGAGCCGCA GCCGGGACCG GTCCAGGAGA CGGCGAACCT GCCGCAGCTC
ACATTGGTAC CCATAGTGGA CTGCCATCGC TATGACCTCC TTCTCTCCGG AGGTGCCCAT
GGGACTGCGT GA
 
Protein sequence
METIRKVRKA HFVDGKGIRQ IVREFKLARN TVRDIIRSGK TDQKYERSKQ PRPKLGLFAD 
RVSELLTDDS AKPVKHRRSA QILFEQLQRE GYEGGYDTLR RYVAAWKQSK EAATVRAFIP
LAYDPGDAFQ FDWSYESVEL GGVPVEVKIA QFRLCHSRKP YCVGYTRESL EMVLDAHVRA
FEFFGGVCKR GIYDNLKTVV TKVLIGKDRV FNRRFQNLAS HYLFDPVACT PAAGWEKGQV
ENQVGVVRNR FFAKRRRFAD LSELNEWLEQ ECRNHGAAAR HPERKDRTID EVFAEEKGHL
LTLPASPFDG YQESVARVSS QLLISFDRNR YSVNAMAVGK TVAVRAYADR IIMVLNDRVV
AMHRRYLGRD KVIYDPWHYI AVLEQKPGAL RNGAPFKGWN LPPSLLEVKT NLEGRPDGDR
QFVAILSAVR RYSLDAVAEA CSQALVDKTV SSDVILAILS RKHDEPQPGP VQETANLPQL
TLVPIVDCHR YDLLLSGGAH GTA