Gene GM21_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3359 
Symbol 
ID8138726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3886915 
End bp3888597 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content64% 
IMG OID644870977 
ProductABC-1 domain protein 
Protein accessionYP_003023142 
Protein GI253701953 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.779537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATAGAA TCCTGAACAT CAACCGCAAC GTCCGGAGCA TCCGGCGTTA CCGGCAGATC 
ATCACGGTAA TGGGCGGGTA TGGCCTCGGG CAGTTGCTGG AATACCTGAA CCTGGGGCAG
GTGGTGGCCT TGTCGCGCCG CATGCTGCGC CGTCCCAGCA AGGCGGCTCA CCTCTCGGCG
CCGGAGCGCC TGCGCTTGGC CCTCGAGGAA CTGGGGCCGA CCTTCATCAA GCTGGGACAG
CTCCTCTCCA CCCGCGCCGA CATCATCCCC CCCGCCTTCG TGCAGGAACT GGCGCGCCTG
CAGGACGAGA TACCCTGCAT CGATTTCGAG GAGATAAAGG TACAGATCGA GCATGAGTTG
GGGGTACCGC TGGAAAACCG GTTCCTCCGC GTGGAGCCGG TGGCCATCGC CGGGGCGTCG
ATTGCGCAGG TGCACCGGGC CACGCTCGTC ACCGGGGAGG ACGTGGTGGT GAAGGTGCGC
CGCCCCGGGG TGATGGGGGC GGTCGAGACC GACATCGACA TCCTGATGGG GGTGGCGCTG
CTTTTGGAGC GCCACATGGC CAGAAGCGAC ATCTACGACC CGGTTGGGGT GGTGCGGGAA
TTCTCCTACA CCATCCGGCG CGAAATGGAT CTCTCCCGCG AGGGGCACGC CATCGAGCGT
ATCCGTGACA ACTTCAAGGG GTACCCCGAC CTTCATTTCC CGCAGGTCTA CTGGGAGGCG
ACCGCGAAGG GTGTGCTCAC CACCGAGTAC GTGGACGGCA TCAAGGTGAG CGACATCTGC
GCCATCGAGA AGGCTGGGCT GGACCGGCGC GAGATAGCGC GGCGCGGGGC GACGGCCTTT
CTGAAGATGG TGCTGGAACA CGGCTTCTTC CACGGCGACC CCCATCCGGG GAACGTGATG
ATCCTCCCCA ACAACGTGAT CTGCCTGCTC GACTACGGCA TGGTGGGAAG GCTAGACCCC
GCTGTGAAGC GCTACCTGAC CGACGTCTTG GGCGCGGTGA TCGACCGGGA TGTCGAGGGG
CTCGCCTACA TCGTAGCGGA GGCCGGCGAC GCGGGCGAGA ACGTCAACAT GCACGCGCTG
AAAAAGGGGC TCGCCGAGTT CATCGACAGC TACTTCGACA TCCCGCTCAA GGAGATCGTG
GTGGGGCGCA TGCTCCTGGA GTTCATCGAC CTGGTTTCCA CGCACCGCAT CAAGGTGCAC
CCGGACCTCA CCATGCTGGT CAAGGTGCTG GTGGTGGTGG AGGGGATGGG GAGAAAGCTC
GATCCCGATT TCGACATGGT AGGGCACCTG CGGCCGTTCC TGGAGAGGGA GTTCAGGCAG
CAGCACTCGC CGGGGCGACT TTTGCGCGAG ATGGAGCAGG GGCTGGAGGG ATACCTCACC
CTGGCGCGCA ACCTGCCGCG GGAGCTGAAG GAGATCCTGA ACAAGATCAA CCGGAACAAG
TTCCGCATCG ACCTGGAACA CCGGGGGCTG GACCGTTTCA GTAGGGAGCT CGACCGCTCG
GCGAACCGTG TCTGCCTGAG CCTCATCATA GCCGCGCTGC TGATCGGCTC CTCCATCGCC
ATGCAGACCA ACCGCGGCCC GATGCTCTGG GGGCTCCCCG TATTCGCCTT TTTCGGCTAC
AGCTGCGCCG GAATAGTCGG CATCTGGTGG ATGATCGCCA TCCTCCGCTC CGGCAGACTG
TAG
 
Protein sequence
MYRILNINRN VRSIRRYRQI ITVMGGYGLG QLLEYLNLGQ VVALSRRMLR RPSKAAHLSA 
PERLRLALEE LGPTFIKLGQ LLSTRADIIP PAFVQELARL QDEIPCIDFE EIKVQIEHEL
GVPLENRFLR VEPVAIAGAS IAQVHRATLV TGEDVVVKVR RPGVMGAVET DIDILMGVAL
LLERHMARSD IYDPVGVVRE FSYTIRREMD LSREGHAIER IRDNFKGYPD LHFPQVYWEA
TAKGVLTTEY VDGIKVSDIC AIEKAGLDRR EIARRGATAF LKMVLEHGFF HGDPHPGNVM
ILPNNVICLL DYGMVGRLDP AVKRYLTDVL GAVIDRDVEG LAYIVAEAGD AGENVNMHAL
KKGLAEFIDS YFDIPLKEIV VGRMLLEFID LVSTHRIKVH PDLTMLVKVL VVVEGMGRKL
DPDFDMVGHL RPFLEREFRQ QHSPGRLLRE MEQGLEGYLT LARNLPRELK EILNKINRNK
FRIDLEHRGL DRFSRELDRS ANRVCLSLII AALLIGSSIA MQTNRGPMLW GLPVFAFFGY
SCAGIVGIWW MIAILRSGRL