Gene GM21_3526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3526 
Symbol 
ID8138898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4068598 
End bp4069629 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content67% 
IMG OID644871145 
Productselenide, water dikinase 
Protein accessionYP_003023305 
Protein GI253702116 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA AGGTACGACT CACAACGATG GTGCAGGCGG CGGGTTGAGC TGCCAAGCTG 
GGCCCGGCGG GCCTGGAAGA AGCCATTCAC GACATAACGC GCTCGGATGA CCCCAACCTC
ATCGTGGGGG TGGAAGGGGC GGAGGACGCC GGCATCTACC GGATCGGGGA TAACCTCGCC
CTGGTGGAGA CGACGGATAT CATCACGCCG CTGGTGGACG ACCCCTTCAC CTTCGGCCGC
ATCGCCGCGG CTAACGCCCT CTCGGACGTC TACGCCATGG GGGGGAGACC GGTGACGGCG
ATGAACTTGG CCTTCTTCCC CGCCTGCTCG CTCCCGACAA AGGTCCTCGC CGCCATCCTG
GCTGGGGGCT CGGACGCGCT CAAGGAGGCG GGGGCCTGCC TAGTCGGCGG GCACACGGTG
GAGGACAACG AGCTCAAGTT CGGGCTCGCG GTGACCGGCC TCATCGACCC GGCTCGCGTG
GTCAGAAACT GCACCGCCCG ACCGGGGGAC CTCATCGTCA TCACCAAGCC TCTTGGAACC
GGCATCGTCT CCACGGCCAT CAAGGCGGAG ATGGTCGAAC CGGTGCTGGA GGCGGAGGCA
ACCCGCTGGA TGACCATCCT CAACGCGCAG GCTGCGGAAC TGATGGTCGC CTGCCGCGCC
ACGGCCGCCA CGGACGTGAC CGGATTCGGC TTCATCGGCC ATGCCTGCGA GATGGCTCTC
GGGGCGAAGG TCACCTTCAG GATCGAACTT GCCCGGGTGC CGGTCATTCC GGGGGTCCCG
GCGCTGATCG ACGACGGCCT CGTCCCCGCC GGCTGCTACC GAAACCGCCA GCACTATGAA
CAACACGTCT CCGGAAAGAG CGGCGACCCC CTCTTGCCGC TCTTCGACCC CCAGACCTCG
GGGGGGCTGT TGATCACCTT CGCTCCCGAC GACGCCCGCA CTTTCCTCTC CCGCGCCGGG
GAGGAAGGGC TTTTCGCCGC CTGCATCGGC GAGGTCGAGC CCGCCGGAGG GACCCCTCTT
GTCTTCGTCT AG
 
Protein sequence
MTDKVRLTTM VQAAGUAAKL GPAGLEEAIH DITRSDDPNL IVGVEGAEDA GIYRIGDNLA 
LVETTDIITP LVDDPFTFGR IAAANALSDV YAMGGRPVTA MNLAFFPACS LPTKVLAAIL
AGGSDALKEA GACLVGGHTV EDNELKFGLA VTGLIDPARV VRNCTARPGD LIVITKPLGT
GIVSTAIKAE MVEPVLEAEA TRWMTILNAQ AAELMVACRA TAATDVTGFG FIGHACEMAL
GAKVTFRIEL ARVPVIPGVP ALIDDGLVPA GCYRNRQHYE QHVSGKSGDP LLPLFDPQTS
GGLLITFAPD DARTFLSRAG EEGLFAACIG EVEPAGGTPL VFV