Gene GM21_0429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0429 
Symbol 
ID8135738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp506444 
End bp507658 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content51% 
IMG OID644868047 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003020267 
Protein GI253699078 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones129 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGA AGGTGAAGCC GGGTTATAAG CAGACCGAGG TGGGTGTGAT TCCGGAGGAG 
TGGGATTGTT GCATGCTTCG GGATGGTATT GTCCTTCTGT CTGGACATCA CATCTTGGCT
CATTACTGCA ATATGAGTGG CTGCGGTGTT CCATACCTTA CGGGGCCGGC AGACTTTCGT
AATGGCGCTA TTGCTAACAC CAAGTTCACG AATAAGCCTG CCACATTATG CAGTGATGGT
GACATTCTTG TTACGGTAAA AGGTTCTGGT TCTGGAACTA TTGTAGTGGC TGACAAAATG
TATTGCATCA GCCGACAACT GATGGCAATT AGGCCGTTGG AATGGAATTC TATATTCCTC
TATTACTCAC TCCTTCAGAA CGCATTACAC TTTAAAGCCG CCTCTGCTGG ACTGATTCCG
GGATTGTCCC GCTCAGACAT TCTAGAACAG TTGGTTCCGT TACCACCGCT CCCCGCACAA
AACACGATTG CGGATGCATT GAGCGATGTA GATGTGTTGC TGGGGGCGCT GGACCGGCTT
ATTGCCAAGA AGCGTGACCT CAAACAGGCC GCCATGCAGC AACTCCTTAC GGGTGAAACC
AGACTCCCTG GGTTTCATGG CGAGTGGGCG GTGAAGCGGT TGGGAGATCT CGGGACCTTC
TTGAAAGGTA ATGGCATCAG GAAAGACGAA GCGATGAGCG GCGCGCTGCC CTGTGTTCGT
TATGGCGAGA TTTACACGCA CCACAACAAT TACGTGAAGT CATTCAACTC TTGGATCTCT
CCAGAGGTGG CTGTCTCTGC AACACGTCTA AAAAAAGGCG ATTTGCTATT TGCTGGCTCT
GGTGAAACCA AGGAGGAAAT CGGAAAATGT GTAGCTTGTA TAGATGATTG CGACGCATAT
GCGGGTGGAG ACATAGTAAT TCTTCGCTTA GCCGCGGCCC ATCCTTTGTT CATGGGCTAT
TACTGCAACA TCGCGACTGT AAATGCTCAG AAGGCCAGTA GAGCACAGGG GGATGCCGTG
GTGCACATTG GTGCGGTTGC CCTGTCCAGT GTGTTGGTTT CAGTTCCGTC AGTAAGTGAG
CAAGTTGCCA TCGCAGAGGT GCTATTCGAC ATGGACGCAG AACTCGCGGG TTTGGAGCAG
CGTCGCGACA AGACTCGTTC CCTAAAGCAG TCCATTATGC AGGAATTACT CACCGGAAAA
ACGCGCCTTA TCTGA
 
Protein sequence
MSAKVKPGYK QTEVGVIPEE WDCCMLRDGI VLLSGHHILA HYCNMSGCGV PYLTGPADFR 
NGAIANTKFT NKPATLCSDG DILVTVKGSG SGTIVVADKM YCISRQLMAI RPLEWNSIFL
YYSLLQNALH FKAASAGLIP GLSRSDILEQ LVPLPPLPAQ NTIADALSDV DVLLGALDRL
IAKKRDLKQA AMQQLLTGET RLPGFHGEWA VKRLGDLGTF LKGNGIRKDE AMSGALPCVR
YGEIYTHHNN YVKSFNSWIS PEVAVSATRL KKGDLLFAGS GETKEEIGKC VACIDDCDAY
AGGDIVILRL AAAHPLFMGY YCNIATVNAQ KASRAQGDAV VHIGAVALSS VLVSVPSVSE
QVAIAEVLFD MDAELAGLEQ RRDKTRSLKQ SIMQELLTGK TRLI