Gene GM21_0369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0369 
Symbol 
ID8135676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp445985 
End bp447052 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content58% 
IMG OID644867986 
ProductIntegrase catalytic region 
Protein accessionYP_003020208 
Protein GI253699019 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value0.375547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGC TAGTTGAAGA AGCTCATGCT TCTGGTGCGC GCCTTGAACC AGCGTGCAGA 
CTCGCAGGTA TCGACGTGCG GACCCTGCAG AGGTGGAAAT CGGCCAACGG TCTTGTAGCT
GGCGACAAAC GACCTGAAGC CATCCATCCG CGTCCCGCAC ATGCGCTGTC AGTTACCGAA
CGGGAGGCCA TTCTTAGCAT TGCAAACGAG CCCCGCTTTG CTGAGGCACC ACCGGCGCGA
ATCGTGCCGG TACTTGCCGA TGAAGGCGTT TATGTCGCCA GTGAATCAAG CTTCCACCGT
GTCCTACGAG ACGCTGGGCA GTCCGGTCAT CGGGGGCGTG CAAAAACGCC TCGAAAATCG
AGGCCACCAA CCACTCATGT CGCCACGGCC CCTAACCAGG TCTGGTGCTG GGATATGACC
TTCTTGCCAG CGACGGTCAT CGGGTGCTGG TTTTACATGT ACCTTATTCT AGATCTTTAC
AGCAGAAAGA TCGTCGGGTG GGAGGTCCAC GCCAGCGACG ACTCCGACCA TGCCGCCCAT
CTGGTGCGAC GCACTGCGTT GACGGAAAGC ATCCATTCCG AGCAGACAAA ACCGGTTCTG
CACGGCGACA ACGGATCGAC CTTGAAGGCC ACCACGGTTC TGGCGATGCT CCATTGGCTA
GGGATCGAAC CTTCTTACTC ACGGCCGAGA GTTTCAGACG ACAACGCTTA TGCGGAGGCG
CTGTTCCGAA CTGCCAAATA CCGCCCGGCC TTTCCTGCTG GCGGGTTCAC AGATCTTGAG
GCAGCCCGGC TCTGGGGAGC GCAGTTCGCT CAGTGGTACA ACTTCGAACA TCGCCATAGC
GGAATTCAGT ATGTCACGCC TGCGCAACGC CACGCAGGTG AGGACCGGGA TATCCTCATG
GCGCGCCACG ACCTCTATAC CAAGGCAAAA GAAGCCAAAC CGAGCCGATG GTCACGCAAC
ACTCGTGACT GGACTCCAGT AGGCGCGGTG ACACTCAATC CAGAACGCGA AGCGGTCGTC
AGCACAGCGG TCTTGCAGAC AGAGAATAAA ACCAAAGAAG CGGCATGA
 
Protein sequence
MAQLVEEAHA SGARLEPACR LAGIDVRTLQ RWKSANGLVA GDKRPEAIHP RPAHALSVTE 
REAILSIANE PRFAEAPPAR IVPVLADEGV YVASESSFHR VLRDAGQSGH RGRAKTPRKS
RPPTTHVATA PNQVWCWDMT FLPATVIGCW FYMYLILDLY SRKIVGWEVH ASDDSDHAAH
LVRRTALTES IHSEQTKPVL HGDNGSTLKA TTVLAMLHWL GIEPSYSRPR VSDDNAYAEA
LFRTAKYRPA FPAGGFTDLE AARLWGAQFA QWYNFEHRHS GIQYVTPAQR HAGEDRDILM
ARHDLYTKAK EAKPSRWSRN TRDWTPVGAV TLNPEREAVV STAVLQTENK TKEAA