Gene Smed_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4002 
Symbol 
ID5319249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp456248 
End bp457288 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content58% 
IMG OID640775810 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001312743 
Protein GI150376147 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.242709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCA ACCATCTTCA GGACAGCAAA ATTCCCGTCA CCATCCTGAC CGGCTTTCTC 
GGTGCCGGGA AGACGACGCT TTTGAACCAC ATACTGACTG AACGGCACGG CCACCGCATC
GCCGTGATCG AGAACGAGTT CGGCGAGGTG GATGTCGACT CGGACCTGGT GCTCGCCTCG
GAGGAAGAGA TCTACCAGAT GAAGAACGGC TGCATCTGCT GCTTCGTCGA CGTGCGCAAC
GACCTGATCG AGGTCCTGCA GAAACTGCTT GCCCGAAAGG ACAAGTTCGA CCACATCCTC
GTCGAGACCA GCGGGCTGGC AGACCCGACC CCCGTTGCAA CAGCCTTCTT CATCGATGAT
GAAATCGGCA AGCATGTGAC GCTGGACGGC ATCGTGACCC TGGTCGACGC CAAGCATATC
GGACAGCATA TCGAGGATCC CGTTCTCGAT GGGCGCGACA ACCAGGCGGT CGATCAGATC
GTCGCCGCCG ACCGTATCAT CATCAATAAG ATCGACCTCG TATCGGATGG CGAGATCGCT
CCTCTGGAAC GCGACATGCG CAAGCTCAAC CAGACGGCCG AAATCGTACG CTCGAGCTAT
GGCAAGGTGG ACCTGTCGAG CATCCTCGGC ATTTCCGGTT TCGCGCCATC CTATGTTGCC
GAACGCGCCA AGCTGCTCGA TCTCGATCAC CACCACCACG GTCATCACCA CCACCATCAT
CATGATGCGA CGGTCAGCTC GGAATCCTTC GTCTTCGACC GGCCCTTCGA CCAGCATCGC
CTGACGGAAT ATCTCTCGGA CCTGCTTCGG GAAAAGGGCG ACGACATATT CCGTACCAAA
GGCATCATAG CGATCACCGG AGACCCTCGC TTCTTCGTCC TCCAGGCGGT GCACAAGCTG
ATGGATTTCC GTCCGGATCA TGTCTGGGGG AAGGATATGC CCTATTCGAA GCTGGTCTTC
ATCGGCCGCA ATCTCGACCG GGCGGTCCTG GAGGAAGGTC TGAAGCGCTG CCTTACCCCG
GCCGGCGAAA CGGTTTATTG A
 
Protein sequence
MQTNHLQDSK IPVTILTGFL GAGKTTLLNH ILTERHGHRI AVIENEFGEV DVDSDLVLAS 
EEEIYQMKNG CICCFVDVRN DLIEVLQKLL ARKDKFDHIL VETSGLADPT PVATAFFIDD
EIGKHVTLDG IVTLVDAKHI GQHIEDPVLD GRDNQAVDQI VAADRIIINK IDLVSDGEIA
PLERDMRKLN QTAEIVRSSY GKVDLSSILG ISGFAPSYVA ERAKLLDLDH HHHGHHHHHH
HDATVSSESF VFDRPFDQHR LTEYLSDLLR EKGDDIFRTK GIIAITGDPR FFVLQAVHKL
MDFRPDHVWG KDMPYSKLVF IGRNLDRAVL EEGLKRCLTP AGETVY