Gene GM21_0186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0186 
Symbol 
ID8135489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp221272 
End bp222258 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content63% 
IMG OID644867805 
Productserine/threonine protein kinase 
Protein accessionYP_003020029 
Protein GI253698840 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00000000000000117305 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGAAA CATCGCATCC CTTTTCCACC CTCACCCCCA ACTTCATCAT GGACGCCGTC 
GAAAGCCAGG GATTCCGTTG CGACTGCCGC ACTTCCGCCT TGAACAGTTA CGAGAACCGC
GTGTACCAAG TGGGGATCGA AGAAGAAAAA CCGCTGATCG CGAAGTTTTA CCGCCCCGGG
CGCTGGAGCG ACGAGCAGAT CAGGGAGGAG CACCAGTTCT GCCTCGAACT GGCGGAACAC
GAGCTGTCCG TGGTCGCTCC TTGGATGAAC CCAGCTGGCG ATACCCTCTT CCATTTCGAC
GGGTTCCGGT TCGCCCTCTA CCCGCGCCAA GGGGGGCACG CCCCCGAGTT CGACAACGAC
GAGAACCTGG CGATCCTCGG TAGAATGCTG GGGCGCATTC ACAGCATCGG CGCCATACGC
CCCTTCAAGG AGCGCCCCAC CCTGGAAAGC CGCAGCTTCG GGCACGACAG CGTAGCCCTC
ATCAAAGAAC GCTTCATCCC TGAGGAATAC CGCGCAAGCT ACACGGCGGT CACCGACCAG
CTGCTTGCCG CCATCGATGC GGCCTTCGCA CAGACGCAGG GGGTGACCCA GATCAGGGCG
CATGGAGATT GCCATGCCGG CAACATCCTG TGGCGGGACG GCGCGCCGCA TTTCGTCGAC
TTCGACGACG CCCGCATGGC GCCGGCGGTG CAGGACCTCT GGATGATGCT TTCGGGTGAG
CGGCCGCGCC AGCTGGTGCA ACTGGAACAA CTGGTGAAGG GATACACCGA ATTCCGCGAC
TTCCACCCCG GAGAACTCAT GCTGGTGGAG CCGCTGCGCG CCCTGCGCAT GCTGCACTAC
AGCGCCTGGC TGGCCCGGCG CTGGGAGGAT CCCACCTTCC CTATCACCTT CCCCTGGTTC
AACACGGTGC GCTACTGGGG CGAGCACATC CTGCAGCTGC GCGAGCAGTT GTCTGCGCTC
GACGAGCCGC CCCTGGAACT TCCTTGA
 
Protein sequence
MKETSHPFST LTPNFIMDAV ESQGFRCDCR TSALNSYENR VYQVGIEEEK PLIAKFYRPG 
RWSDEQIREE HQFCLELAEH ELSVVAPWMN PAGDTLFHFD GFRFALYPRQ GGHAPEFDND
ENLAILGRML GRIHSIGAIR PFKERPTLES RSFGHDSVAL IKERFIPEEY RASYTAVTDQ
LLAAIDAAFA QTQGVTQIRA HGDCHAGNIL WRDGAPHFVD FDDARMAPAV QDLWMMLSGE
RPRQLVQLEQ LVKGYTEFRD FHPGELMLVE PLRALRMLHY SAWLARRWED PTFPITFPWF
NTVRYWGEHI LQLREQLSAL DEPPLELP