Gene EcSMS35_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0032 
SymbolcarA 
ID6144305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp34771 
End bp35946 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID641614933 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_001742149 
Protein GI170683312 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGAAT ATTCTCTGGA GGGTGTTTTG ATTAAGTCAG CGCTATTGGT TCTGGAAGAC 
GGAACCCAGT TTCACGGTCG GGCCATAGGG GCAACAGGTT CGGCGGTTGG GGAAGTCGTT
TTCAATACTT CAATGACCGG TTATCAAGAA ATCCTCACTG ATCCTTCCTA TTCTCGTCAA
ATCGTTACTC TTACTTATCC CCATATTGGC AATGTCGGCA CTAATGACGC CGATGAAGAA
TCTTCTCAGG TACATGCACA AGGTCTGGTG ATTCGCGACC TGCCGCTGAT TGCCAGCAAC
TTCCGTAATA CCGAAGACCT CTCTTCTTAC CTGAAGCGCC ATAACATCGT GGCGATTGCC
GATATCGATA CCCGTAAGCT GACGCGTTTA CTGCGCGAGA AAGGCGCACA GAATGGCTGC
ATTATCGCGG GCGATAACCC GGATGCGGCG CTGGCGTTAG AAAAAGCCCG CGCGTTCCCA
GGTCTGAACG GCATGGATCT GGCAAAAGAA GTGACCACCG CAGAAACGTA TAGCTGGACA
CAAGGGAGCT GGACGCTGAC CGGCGGCCTG CCAGAAGCGA AGAAAGAAGA CGAGCTGCCG
TTCCATGTTG TGGCTTATGA TTTTGGTGCC AAGCGCAACA TCCTGCGCAT GTTGGTGGAC
AGAGGCTGTC GTCTGACTAT CGTTCCGGCG CAAACTTCTG CGGAAGATGT GTTGAAAATG
AATCCAGACG GCATCTTCCT CTCCAACGGC CCTGGCGACC CGGCCCCATG CGATTACGCC
ATTACCGCCA TCCAGAAATT CCTCGAAACC GATATTCCGG TATTCGGCAT CTGCCTCGGT
CATCAGCTGC TGGCGCTGGC GAGCGGTGCG AAGACTGTCA AAATGAAATT TGGTCACCAC
GGCGGCAACC ATCCGGTTAA AGATGTTGAG AAAAACGTGG TGATGATCAC CGCCCAGAAC
CACGGTTTTG CGGTGGATGA AGCAACATTA CCTGCAAACC TGCGTGTCAC GCATAAATCC
CTGTTCGACG GTACGTTACA GGGCATTCAT CGCACCGATA AACCGGCGTT CAGCTTCCAG
GGTCACCCGG AAGCCAGCCC TGGTCCACAC GACGCCGCGC CGTTGTTCGA CCACTTTATC
GAGTTAATTG AGCAGTACCG TAAAACCGCT AAGTAA
 
Protein sequence
MSEYSLEGVL IKSALLVLED GTQFHGRAIG ATGSAVGEVV FNTSMTGYQE ILTDPSYSRQ 
IVTLTYPHIG NVGTNDADEE SSQVHAQGLV IRDLPLIASN FRNTEDLSSY LKRHNIVAIA
DIDTRKLTRL LREKGAQNGC IIAGDNPDAA LALEKARAFP GLNGMDLAKE VTTAETYSWT
QGSWTLTGGL PEAKKEDELP FHVVAYDFGA KRNILRMLVD RGCRLTIVPA QTSAEDVLKM
NPDGIFLSNG PGDPAPCDYA ITAIQKFLET DIPVFGICLG HQLLALASGA KTVKMKFGHH
GGNHPVKDVE KNVVMITAQN HGFAVDEATL PANLRVTHKS LFDGTLQGIH RTDKPAFSFQ
GHPEASPGPH DAAPLFDHFI ELIEQYRKTA K