Gene GM21_3879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3879 
Symbol 
ID8139253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4465057 
End bp4466463 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content66% 
IMG OID644871496 
Productselenocysteine synthase 
Protein accessionYP_003023654 
Protein GI253702465 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1921] Selenocysteine synthase [seryl-tRNASer selenium transferase] 
TIGRFAM ID[TIGR00474] seryl-tRNA(sec) selenium transferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.797755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGG GGGTGCTCGT GCAGCAGCTT AAAATGATCC CGAAGGTTGA CCGGGTCCTC 
GAGTGGGAGG CCGTGCGCAA GCTTCTGTCC ACCTACCCAA GGGAGCTGGT GCTGCGAGCC
GTCCGCAGCG TGCTGGATAG ACTGCGCTGC GGCGCCCGTG CCGGCGGTCT CGACGATTCC
GCCTTCCTTG AGCCCGCAGT CTGCGCCGCA GTGGCCCGTG AACTAGCCCA ACTCTCCCGG
CCGAGCCTGC AGAGGGTGAT CAACGGCTCC GGCATCGTGA TACACACAAA TCTGGGACGC
TCCATCCTCC CCGAGGCCGC CCGCGACGCC CTCAACACCA TAGCCTTCTC CTATTCGAAC
CTTGAGTTCG ACCTCGAAGC GGGGGCGCGC GGCAGCCGCT ACAGCCACGT GGAGGCGCTC
CTTTGCGAGC TGACCGGGGC CGAGGCCGCC ATCGTGGTCA ACAACAACGC CGCGGCGGTT
CTCTTGTCGC TGAGCTCGAT GGCCTCCGGG CGCGAGGCGG TGGTTTCGCG CGGTGAGCTG
GTGGAGATTG GGGGGTCGTT CCGCATTCCC GAAGTGATGC GACTTTCGGG CGTGACGCTG
AGGGAGGTGG GGACGACGAA CCGGACCCAT CCCAAGGACT ACAGCGGCGC GGTTAACGAG
CAGACCGCGG TGTTTCTCAA GGTGCACTGC AGCAACTTCG CGGTTCTCGG CTTCACCGCC
GAGGTGACCG CACAGGAGTT GGTGGCGCTT GGGGCCGCCG CCGGGGTCCC GGTACTGGCC
GACATGGGAA GCGGCAATCT GGTCGACCTT TCGGGGCGCC TGCCGGTTCC GGAACCGACG
GTTCAGGAGT TCGTGCGCGC GGGGGTCGAC GTCATCACCT TCAGCGGCGA CAAGCTCCTG
GGAGGCCCGC AGGCCGGGAT CATCGTGGGG AAAAAGCCGT TCATCGAGGC GATGAAAAAG
CACCAGCTGC TGCGCGCGCT CAGGATGGAC AAGCTCACCC TGGCGAGCCT GGAGGCGACG
CTCGCCCTGT ACCGGGACGA GATGGTCGCG CTCAAGGAAG TGCCGACACT GAGGATGCTG
ACCGCCACCC TCCCCGACTT GACGGCGCGG GCGAAAAAGA TCAGCGCATT TTTGCGCCGC
CGCACGCCGC AGGGGATCAG CTTCAAGCTG AACGAGGGGT TTTCCCAGGC AGGAGGCGGG
ACGCTGCCGC TGTTGAACCT CCCCAGCATG CTGATCGAGG TCGCGGTGGA GGGACTCTCC
CCCAACGACA TCGAGTCGCG GCTCAGGAAA TCCGAGATCC CGGTCATCGG CAGGATCAAC
AAGAACGCCT TCCTGCTCGA CCCCCGCACC CTGCTGGACG GCGACCTGCC GGACCTCGCC
GCGGCCATCT CCGGGCTGGC AGGGTAG
 
Protein sequence
MQEGVLVQQL KMIPKVDRVL EWEAVRKLLS TYPRELVLRA VRSVLDRLRC GARAGGLDDS 
AFLEPAVCAA VARELAQLSR PSLQRVINGS GIVIHTNLGR SILPEAARDA LNTIAFSYSN
LEFDLEAGAR GSRYSHVEAL LCELTGAEAA IVVNNNAAAV LLSLSSMASG REAVVSRGEL
VEIGGSFRIP EVMRLSGVTL REVGTTNRTH PKDYSGAVNE QTAVFLKVHC SNFAVLGFTA
EVTAQELVAL GAAAGVPVLA DMGSGNLVDL SGRLPVPEPT VQEFVRAGVD VITFSGDKLL
GGPQAGIIVG KKPFIEAMKK HQLLRALRMD KLTLASLEAT LALYRDEMVA LKEVPTLRML
TATLPDLTAR AKKISAFLRR RTPQGISFKL NEGFSQAGGG TLPLLNLPSM LIEVAVEGLS
PNDIESRLRK SEIPVIGRIN KNAFLLDPRT LLDGDLPDLA AAISGLAG