Gene GM21_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1549 
Symbol 
ID8136879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1808879 
End bp1809823 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content63% 
IMG OID644869162 
ProductLAO/AO transport system ATPase 
Protein accessionYP_003021363 
Protein GI253700174 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0000000360873 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCTTG CCGAACGTGT GCTGGATGGC GAGATACGCG CCGCGGCGCG CCTGATGCGG 
GATATCGACG ACCGTTTCAA AAGTGCTGTC GACGAGTTGA AGATCCTCTA CCCCCATACA
GGAAAAGCCT ATATCATCGG CATCACCGGC CCGCCGGGCG CCGGCAAGTC GACCCTCGTC
GACCAGGTAG TGGCGGCCTA CCGCAAGAAG GACCTCCTGG TGGGCGTGGT TGCGATCGAC
CCGACCAGCC CCTTTTCCGG CGGCGCCATA CTGGGTGACC GCATCCGCAT GAACCGCCAT
GCCGACGACC CCGGCGTCTT CATCAGGAGT CTTGCCACCC GCGGCGCACT GGGAGGGCTT
TCACGCTCGA CCATGGACGT GCTCAACGTG ATGGACGCGA TGGGGCTTGA CGTCATCGTG
GTGGAAACGG TCGGGGTAGG GCAGGACGAG GTCGACATCG TCAGCACCGC GCACACCACG
GTGGTGGTCA TGGTCCCGGG TCTGGGAGAC GACATCCAGG CGATCAAGGC CGGCATCCTC
GAAATCGGGG ACGTCTTCGT GGTCAACAAG GCGGACCGTG ACGGCGCCGA GCGCACCGAG
CGCGAACTGA CCGCGATGCT CGAGATGAAA CACCCCGAAC CGGGCGATTG GCTGCCGCAC
GTGATCAAGA CCGAAGCGGC CAAAGGGCTC GGCATCGACG AACTTGTGGA TGAATTCGAG
GCGCACCACA GCTACCTCAA GGAGTCGGGT GCCCTGCAGC GCCTGATCCA GGAACGAAAC
GCCAAGATCT TCGCGGACAC GCTGCGCGAG GAACTCTTCG AGTCCGTCTT CAGCGGCATA
AAGGAAAGCG GGAAGTATCA GCAGATACTC GACGGCATGC GCGACAGGAG CACCGATCCC
TACAGCGCGG TTGAAGAGGT CATGGCTGCG CGCTCTTTTT CTTGA
 
Protein sequence
MSLAERVLDG EIRAAARLMR DIDDRFKSAV DELKILYPHT GKAYIIGITG PPGAGKSTLV 
DQVVAAYRKK DLLVGVVAID PTSPFSGGAI LGDRIRMNRH ADDPGVFIRS LATRGALGGL
SRSTMDVLNV MDAMGLDVIV VETVGVGQDE VDIVSTAHTT VVVMVPGLGD DIQAIKAGIL
EIGDVFVVNK ADRDGAERTE RELTAMLEMK HPEPGDWLPH VIKTEAAKGL GIDELVDEFE
AHHSYLKESG ALQRLIQERN AKIFADTLRE ELFESVFSGI KESGKYQQIL DGMRDRSTDP
YSAVEEVMAA RSFS