Gene Smed_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2229 
Symbol 
ID5323090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2309639 
End bp2310844 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content64% 
IMG OID640791167 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_001327896 
Protein GI150397429 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA CACCCGCATG GACAATCCAG AAACCCACTG CCCTGCTCGT TCTGGCCGAC 
GGCACGGTGA TCGAAGGCAA GGGCATCGGC GCGACCGGAA AGGTTCAGGC CGAGGTCTGT
TTCAACACGG CGCTGACCGG ATACCAGGAA ATCCTGACCG ATCCCTCCTA TCTCGGTCAG
ATCGTCACCT TCACCTTCCC GCATATCGGC AATATCGGCG CCAATGACGA GGACATCGAG
GATCTGACAC CCGCCGCGCG CCACGGTGCC GTCGGCGTGA TCTTCAAGGC CGACATCACG
GAGCCCTCCA ACTACCGCGC CGCCAGGCAT CTCGATACTT GGCTGAAGGC GCGCGGCATC
ATCGGCCTCT GCGGCATCGA CACGCGTGCG CTGACCGCCT GGATCCGCGA AAACGGCATG
CCGAACGCCG TCATCGCACA CGATCCGGCG GGCGTCTTCG ATGTCGGGGC GTTAAAGGCC
GAGGCGAAGG CGTGGAGCGG TCTCGAAGGC CTCGACCTCG CCAAGGTCGC CACTTCCGGC
CAGTCCTATC GCTGGGATGA GAAGCCCTGG ATCTGGGACG AGGGCTATTC GACGCTCGGC
GAAACCGATG CCGCCTATCA TGTCGTCGCC CTCGACTACG GCGTCAAGCG GAACATTCTC
CGCCTCTTCG CCGGGCTAAA TTGCCGTGTC ACCGTCGTCC CGGCTCAGAC GAGCGCCGAG
GAAGTTCTGG CGCTAAGGCC CGATGGCATC TTCCTGTCGA ACGGCCCGGG CGACCCGGCC
GCAACCGGCG AATATGCCGT GCCGGTCATC AAGGATCTCC TCAAGACGGA TATCCCGGTC
TTCGGCATAT GCCTGGGCCA CCAGATGCTG GCGCTGGCGC TGGGTGCCAG GACCGAGAAG
ATGCACCAGG GCCACCACGG CGCCAACCAC CCGGTCAAGG ACCACACCAC CGGCAAGGTC
GAGATCGTTT CGATGAATCA CGGCTTCGCA GTCGATGCGA ACTCGCTCCC GCAAGGGGTT
GAACAGACTC ACATCTCGCT GTTCGACGGC ACCAATTGCG GCCTGCGCGT CGACGGCAGG
CCGGTCTTCT CGGTCCAGCA CCACCCGGAA GCTTCGCCGG GCCCGCAGGA CAGCCATTAC
CTCTTCCGCC GCTTCCTGAA CCTCATTCGT GAGAAGAAAG GCGAACCGGC ACTCGCCGAG
CGCTGA
 
Protein sequence
MTATPAWTIQ KPTALLVLAD GTVIEGKGIG ATGKVQAEVC FNTALTGYQE ILTDPSYLGQ 
IVTFTFPHIG NIGANDEDIE DLTPAARHGA VGVIFKADIT EPSNYRAARH LDTWLKARGI
IGLCGIDTRA LTAWIRENGM PNAVIAHDPA GVFDVGALKA EAKAWSGLEG LDLAKVATSG
QSYRWDEKPW IWDEGYSTLG ETDAAYHVVA LDYGVKRNIL RLFAGLNCRV TVVPAQTSAE
EVLALRPDGI FLSNGPGDPA ATGEYAVPVI KDLLKTDIPV FGICLGHQML ALALGARTEK
MHQGHHGANH PVKDHTTGKV EIVSMNHGFA VDANSLPQGV EQTHISLFDG TNCGLRVDGR
PVFSVQHHPE ASPGPQDSHY LFRRFLNLIR EKKGEPALAE R