Gene Smed_1760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1760 
Symbol 
ID5322618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1842139 
End bp1843203 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content61% 
IMG OID640790698 
Productcobalamin biosynthesis protein CobW 
Protein accessionYP_001327430 
Protein GI150396963 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID[TIGR02475] cobalamin biosynthesis protein CobW 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0805096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0884735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCG CGAAGTCTCA GCAGGGCAAG ATTCCCGCTA CCGTCATCAC CGGGTTTCTC 
GGTGCCGGCA AGACGACGAT GATCCGCAAT CTGCTGCAGA ATGCCGACGG CAAGCGCATC
GCGCTGATCA TCAACGAGTT CGGCGACCTA GGCGTCGACG GCGACGTGTT GAAGGGTTGC
GGAGCTGAAG CCTGCACGGA GGATGATATC ATCGAGCTGA CCAATGGCTG CATTTGCTGC
ACTGTTGCCG ACGACTTCAT TCCGACCATG ACGAAGCTGC TCGAAAGAGA GAACCGTCCG
GACCATATCA TCATCGAGAC CTCGGGCCTG GCGCTGCCGC AGCCGCTCGT CGCGGCATTC
AACTGGCCGG ACATCCGCAG CGAGGTAACG GTCGACGGCG TCGTTACCGT CGTCGATAGC
GCCGCCGTTG CCGCGGGCCG CTTCGCAGAC GATCATGACA AGGTGGACGC ACTCCGCGCA
GGCGATGAAA ATCTCGATCA TGAGAGCCCG CTCGAAGAAC TCTTCGAGGA CCAGCTCACA
GCTGCCGACC TCATCGTTCT CAACAAGACG GATCTCATCG ATGCCGCGGG GTTGAAGTCG
GTGCGCGATG AAGTGGCCTC ACGCATCAAC CGCAAGCCCA CCATGATCGA GGCGAAGAAC
GGTGAGGTAG CAGCTGCCAT CCTGCTCGGG CTCGGGGTGG GTACGGAGGG CGACATCGTC
AACCGCAAGT CTCACCACGA GATGGAGCAC GAGGCGGGCG AGGAGCATGA TCACGACGAA
TTCGACAGCT TCGTCGTCGA ACTGGGTGCC ATAGCCGATC CTGCCGTTTT CGTGGAACGG
CTCAGAAATG TGATCGCACA GCACGACGTG CTGCGCCTCA AGGGTTTCGT CGACGTTCCC
GGCAAATCGA TGCGCCTCCT GATACAGGCG GTGGGCAGCC GCATCGACCA GTATTTCGAT
CGCGCATGGG CTCCGGGCGA AACGCGCAGC ACACGGCTGG TCGTCATAGG CCTGCATGAC
ATGGATGAGC CTGCCCTGCG GGCGGCAATA TCGGCACTTG TGTAA
 
Protein sequence
MTLAKSQQGK IPATVITGFL GAGKTTMIRN LLQNADGKRI ALIINEFGDL GVDGDVLKGC 
GAEACTEDDI IELTNGCICC TVADDFIPTM TKLLERENRP DHIIIETSGL ALPQPLVAAF
NWPDIRSEVT VDGVVTVVDS AAVAAGRFAD DHDKVDALRA GDENLDHESP LEELFEDQLT
AADLIVLNKT DLIDAAGLKS VRDEVASRIN RKPTMIEAKN GEVAAAILLG LGVGTEGDIV
NRKSHHEMEH EAGEEHDHDE FDSFVVELGA IADPAVFVER LRNVIAQHDV LRLKGFVDVP
GKSMRLLIQA VGSRIDQYFD RAWAPGETRS TRLVVIGLHD MDEPALRAAI SALV