Gene Saro_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1642 
Symbol 
ID3918751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1716319 
End bp1718190 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content67% 
IMG OID640444383 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_496916 
Protein GI87199659 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.473054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAT CCCTCCTCGT CGCCAATCGC GGCGAAATTG CCTGTCGCGT GATCCGTACA 
GCGCGGCGCA TGGGCGTTCG CACAGTGGCG GTCTATTCCG ATGCCGATGC CAATGCCCTG
CACGTCCGCT CGGCCGACGA AGCAGTCCAC ATAGGTCCGG CAGCAGCGCG CGAAAGCTAT
CTGGTGGGAG AGCGCATCAT TGCCGCCGCG CTCACCACTG GCGCTGAGGC CATCCACCCC
GGCTATGGGT TCCTCTCGGA GAATGCCGAG TTCGCTCAGG CCGTTCTCGA CGCGGGCCTG
GTCTGGGTTG GCCCGAAGCC GCACTCGATC ACCGCGATGG GCCTCAAGGA CGCCGCCAAG
GCCCGCATGA TCGCGGCAGG CGTGCCGGTG ACGCCGGGAT ATCTGGGCGA GGATCAATCG
GCGGAGCGAT TGCAGGCCGA GGCCGACGCC ATCGGCTACC CGGTACTGAT CAAGGCCGTC
GCGGGCGGAG GCGGCAAGGG CATGCGGCGC GTCGATGCCG CGGCGGACTT TGCCGAAGCG
CTCGCCTCGT GCCGCCGTGA GGCTGCCTCC TCGTTCGGCG ACGACCGTGT GCTTATCGAG
AAGTACATCC TTTCCCCGCG GCACATTGAG GTTCAGGTCT TCGGCGACGC CCACGGCAAC
GTCGTCCACC TGTTCGAACG CGACTGCTCG CTTCAGCGGC GGCACCAGAA GGTGATCGAG
GAGGCCCCTG CGCCCGGCAT GGACGAAGCG ACCCGCGAGG CTGTCTGCGC GGCCGCCGTC
CGCGCGGCCA AGGCGGTCGA CTATGAAGGC GCGGGCACCA TCGAATTCAT CGCCGATGGC
TCCGAAGGCC TGCGCGCGGA CCGGATCTGG TTCATGGAAA TGAACACGCG CCTGCAGGTG
GAACATCCGG TGACCGAGGA GATCACCGGC GTCGACCTCG TCGAATGGCA GCTCCGCGTC
GCGTCGGGTG AGCCGCTGCC CAAGCGGCAA GACGAGCTAT CGATCAACGG CTGGGCGATG
GAAGCCCGGC TCTACGCCGA GGACCCGACC CGGGGCTTCC TGCCCAGCAT CGGGCGCGTC
GACGATTTCC ACTTCCCGCA CCATCATGCG CGCATTGATA CGGGGGTAGA GGCGGGCGCG
GAAATCTCGC CCTTCTACGA TCCGATGATC GCCAAGCTCA TCGTCCACCG CCCGACCCGG
ACTGAAGCCG TCTCTGCGCT GCGCGAGACA CTTGACGAAG GCATCGTCGG ACCGCTCGTC
ACCAACAGCG GCTTCCTCTG GCGCCTGCTG GGCCACGCGG CGTTCGAGGC CGGGGTCGTC
GACACCGGCC TGATCGAGCG CAATCTCGAA ACCCTCGCCA CCCGGCCCGA GCCTTCCCGC
GAGGGTCTGG CTCTGGCCGC AATGCGCCTT GCCGGCACGC CGGGCGCGAC GCCATGGTCG
AGCCGGTCCG GCTTCCGCAT GAACGCTGCC CCGCGCCGCG ACGTTCGTCT TTCCGACCAG
TTCGGCCGGA CGTTCACGAC CGAACTGCCA CCCGAACCGG CGTTCGACTA CTGGCCCGGC
GAAGACGCGA CGACGATCGA CGAAGGTGGC GAGCGCTTTC GCGTGCGCCT GGCGCGGGCC
GACGGGGGCT CCGGCGGCGC AGCCTCGGAT GGGGCCATCC TCGCCCCCAT GCCCGGCAAG
GTCATCTCGG TCGATGTATC CGCAGGCCAG TCGGTGACCA AGGGCCAGAA ACTCATGGTG
CTCGAGGCAA TGAAGATGGA ACATGCCCTC ACCGCCCCCT TCGACGGTGT CGTGGCCGAA
CTCAACGCCG CGCCGGGCGG ACAGGTTCAG GTCGAGGCAC TGCTGGCGAA GATCGAAAAG
GGAGAAGCCT GA
 
Protein sequence
MIKSLLVANR GEIACRVIRT ARRMGVRTVA VYSDADANAL HVRSADEAVH IGPAAARESY 
LVGERIIAAA LTTGAEAIHP GYGFLSENAE FAQAVLDAGL VWVGPKPHSI TAMGLKDAAK
ARMIAAGVPV TPGYLGEDQS AERLQAEADA IGYPVLIKAV AGGGGKGMRR VDAAADFAEA
LASCRREAAS SFGDDRVLIE KYILSPRHIE VQVFGDAHGN VVHLFERDCS LQRRHQKVIE
EAPAPGMDEA TREAVCAAAV RAAKAVDYEG AGTIEFIADG SEGLRADRIW FMEMNTRLQV
EHPVTEEITG VDLVEWQLRV ASGEPLPKRQ DELSINGWAM EARLYAEDPT RGFLPSIGRV
DDFHFPHHHA RIDTGVEAGA EISPFYDPMI AKLIVHRPTR TEAVSALRET LDEGIVGPLV
TNSGFLWRLL GHAAFEAGVV DTGLIERNLE TLATRPEPSR EGLALAAMRL AGTPGATPWS
SRSGFRMNAA PRRDVRLSDQ FGRTFTTELP PEPAFDYWPG EDATTIDEGG ERFRVRLARA
DGGSGGAASD GAILAPMPGK VISVDVSAGQ SVTKGQKLMV LEAMKMEHAL TAPFDGVVAE
LNAAPGGQVQ VEALLAKIEK GEA