Gene Sare_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0031 
Symbol 
ID5707332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp36598 
End bp37935 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content70% 
IMG OID641269556 
Productcarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_001534958 
Protein GI159035705 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.508002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGT CGCTGCTGGT CGTCAATCGG GGTGAGATCG CCCGCCGGAT CATCCGTACC 
GCGAAGCGGC TCGGTGTCCG GGCGGTCGCG GTGCACTCAG AGGCGGACGC CGGCCTACCG
TTTGTGGCCG AGGCCGATGA GGCCGTCTGC GTCGGCCCGG CGAACCCGGC AGAGAGCTAC
CGGAACGTCG AGGCCGTCCT CGCCGCCGCC AAGTCCACCG GGGCGCAGGC GATCCACCCC
GGCTATGGCT TCCTGTCGGA GAACGCCGAC TTCGCCCAGG CGGTCGAGGC CAGTGGCCTG
ATCTGGGTTG GGCCCGGCGC TGATGCGATC ACCGCGATGG GCGACAAGAT CAACGCCCGG
AACCTGATGG CGGCGGCCGG GGTGCCGGTC GCGCCGGGGA CCACGGAACC GGCGGCTGAC
CTCGCCGCAG CGGTCGATGC GGCGGCGGGG ATCGGCTACC CGGTGATGGT CAAGGCCGCC
GCCGGCGGGG GCGGCATGGG CATGGGGATC GCGAACGACG AGGCCGCGCT ACGCACCGAA
TTCGACAAGG TGCGGTCGTT CGCCGAGCGG ATGTTCGGGG ACGGTTCGGT GCTGATCGAA
CGGTACTTCC CTCGGGTACG GCACGTCGAG GTGCAGATCC TCGGCCTGGC CGACGGCCGG
GTGGTGGCGC TCGGTGAACG CGAGTGCTCG GTGCAGCGAC GTAACCAGAA GCTGGTGGAG
GAGTCACCGT CCCCAGCCGT CACTCCCGAG CTCCGGTCCC GTTTCCTGGC TGCGGCGGTG
CGGGCCGGCG AGGCGGTCGG CTACCGGAAC GCCGGCACGG TCGAGTGTCT GCTCGACCCC
ACCACGGATG AGTTCTTCTT CCTCGAGATG AACACGCGGC TGCAGGTTGA GCACCCGGTC
ACCGAGTCGG TCTACGGGGT CGACCTGGTG GAGGAGCAGC TGCGGGTGGC CGCCGGGCTG
CCGCCGACCT TTGACCCGGA CGCCGTCACG CCCCGCGGGC ACGCGATCGA GCTGCGGGTC
AACGCCGAGG ACCCGAAGCG TTTCCTGCCC GGTCCGGGTG CGATCACCGT CTGGACCGAA
CCGGCCGGCG AGGGCGTCCG CGTCGATGCT GGGTACGTCG CCGGCAACAC GGTGACCCCG
TTCTACGACA GCCTGATGGC CAAGCTCATC GTCAGTGGTG CGGATCGCGC GGAGGCGATC
AGCCGTGCGC GGGCCGCGGT GGCGCAGTTC CAGCTCGTCG GCCCGAAGAA CAACCTTCCC
TTCTTCGCCG AGCTGCTGGA CAACGCCGAG TTCCTCTCCG GCGACTACGA CACCGGCATC
GTCTCCCGGA TGCGTTGA
 
Protein sequence
MIESLLVVNR GEIARRIIRT AKRLGVRAVA VHSEADAGLP FVAEADEAVC VGPANPAESY 
RNVEAVLAAA KSTGAQAIHP GYGFLSENAD FAQAVEASGL IWVGPGADAI TAMGDKINAR
NLMAAAGVPV APGTTEPAAD LAAAVDAAAG IGYPVMVKAA AGGGGMGMGI ANDEAALRTE
FDKVRSFAER MFGDGSVLIE RYFPRVRHVE VQILGLADGR VVALGERECS VQRRNQKLVE
ESPSPAVTPE LRSRFLAAAV RAGEAVGYRN AGTVECLLDP TTDEFFFLEM NTRLQVEHPV
TESVYGVDLV EEQLRVAAGL PPTFDPDAVT PRGHAIELRV NAEDPKRFLP GPGAITVWTE
PAGEGVRVDA GYVAGNTVTP FYDSLMAKLI VSGADRAEAI SRARAAVAQF QLVGPKNNLP
FFAELLDNAE FLSGDYDTGI VSRMR