Gene GM21_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0366 
Symbol 
ID8135673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp443943 
End bp445262 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content61% 
IMG OID644867983 
Productintegrase family protein 
Protein accessionYP_003020205 
Protein GI253699016 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.134449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCG GAATCCGATT TACCGACCTG TACATTAAAA ACCTGAAGCC CATGGAAAAG 
GATTACTGGG CGCGCGAAGG GTTGGGGTTT GCGGTCAGGG TGGTTCCTTC CGGGGAGAAG
CTTTGGTACT ACATTTACAC CTTCCAGGGA AAGAAGCGGT ACATGTGGCT CGGGAGCTAT
CCTGCGGTGC CGCTGGCGGC TGCCCGGGAG GCCTGTGAAG TAGCCAGGGC CAAGGTGAAA
GCCGGCACAG ACCCATTAGC GCAGAAGGAC GCGGAGCTGG AGGAGCGGCG CAAGGCTCCC
ACGGTTGCCG ATCTGTGCGC CGAGTACCTG GAGCGCCACG CCAAGCAGTT CAAGCGCTCC
TGGCAGAAAG ATGAGCAGAT GATAAAGCGG GACGTACTGC CGGAGTGGGG CAAGCGGAAG
GCTCGGGACA TCACCAAAAG GGACGTGGTG CTGCTCTTGG AAAAGATCAT GGATCGCGGT
GCGCCGATAC AGGCCAACAC CACCTTCGCC CTAATCCGCA AGATGTTCAA CTTCGCGGTG
GAGCGGGACG TCCTGGAACA CACCCCCTGC CATGGCGTCA AGCCTCCGGC GCCGAAGGTG
GCCCGGGACC GGGTCCTCTC GGAAAGTGAG ACCAGGTCCT TCTGGCACAA CCTGGACGCC
TGCTGCATGT CCAATGAAAC CAGGCGCGCC TTGAAGCTGG TGCTGGTCAC CGCGCAGCGG
CCGGGCGAAG TGATCGGCAT GCATACCGAC GAGATAAAGG GGGAGTGGTG GATACTGCCT
GGAGATAGGG TGAAAAACAA GAAATCCCAC CGGGTGTACC TGTCGACGCT CGCCAGAGAG
ATCCTGGCGG AGGCTGTCGC TGAGAACAAG GAAAAGCTCG GCATCCCGGG GGACCAGGAG
TATCGCGGCT TCATGTTTCC TTCCCCCCAA CTTGCCAAAG TGCAGCCCAT AGCTCCGCAG
GCGCTGATCG TGGCGGTGGG GCGGGCCCTT GCTTCCCCGG TGCTCGATCC GAACTTCAAA
CGGGTCCTCG ACCGGGAGGG GAAGCCCGTC ACGGTAAACC GGCTCGAGGT CGCCCACTTC
ACCCCGCACG ATCTGAGGCG CACCGCGGCA ACGTTCATGG CAGAATCCGG CGAGATGGAT
GAGGTGATCG ACGCCGTTTT GAACCACGCC AAGCAGGGGG TGATCAGGGT CTACAACCAG
TTCAAGTACG ACGCGCAGAA GCAAGCGGCG CTCGAATCTT GGTCCAGGAG GCTCATCTGC
ATCACCACAG GCGTGAAGGG AAAGGTGATC GCCATCGGCA GCCGGTCCAA CTCGGCGTAA
 
Protein sequence
MKSGIRFTDL YIKNLKPMEK DYWAREGLGF AVRVVPSGEK LWYYIYTFQG KKRYMWLGSY 
PAVPLAAARE ACEVARAKVK AGTDPLAQKD AELEERRKAP TVADLCAEYL ERHAKQFKRS
WQKDEQMIKR DVLPEWGKRK ARDITKRDVV LLLEKIMDRG APIQANTTFA LIRKMFNFAV
ERDVLEHTPC HGVKPPAPKV ARDRVLSESE TRSFWHNLDA CCMSNETRRA LKLVLVTAQR
PGEVIGMHTD EIKGEWWILP GDRVKNKKSH RVYLSTLARE ILAEAVAENK EKLGIPGDQE
YRGFMFPSPQ LAKVQPIAPQ ALIVAVGRAL ASPVLDPNFK RVLDREGKPV TVNRLEVAHF
TPHDLRRTAA TFMAESGEMD EVIDAVLNHA KQGVIRVYNQ FKYDAQKQAA LESWSRRLIC
ITTGVKGKVI AIGSRSNSA