Gene Saro_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2231 
Symbol 
ID3916547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2373388 
End bp2374563 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content65% 
IMG OID640444986 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_497503 
Protein GI87200246 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.961008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACC CCGCACTTTC GCACGCGCCT CAACCAGATG GCGCGACGGG AGTCCTTGTA 
CTTGCCGATG GGTCTGTTGC CTGGGGACGT GGATTTGGCG CTGCTGGTGC GGCCGTTGGC
GAGGTGTGCT TCAACACCGC GATGACCGGA TACCAGGAAG TGATGACCGA TCCGTCCTAT
GCGGGACAGG TCGTCACCTT CACTTTTCCG CACATCGGCA ACGTCGGCAC CAATCCGGAC
GACATGGAAA GCCATGACAC GCCGGGCGCC GTGGGCTGCG TGGTGCGCGA GGACGTGACC
GCGCCCGCCA ACTTCCGCGC TGCCGGCACC TTCCAGCAGT GGATGGAGCA GCAGGGCAAG
ATCGGCATCT CCGGCCTCGA CACCCGCGCG CTGACGCGCC GTATCCGCCT TTCAGGCGCG
CCGAACGCGG TGATCGCGCA CGATCCGAAG GGCCAGTTCG ACATTCCCGC GCTGATCGAA
CGCGCGCGGT CCTGGCCCGG TCTCGAAGGC ATGGACCTTG CCCGCGTCGT TACACGCGAC
GGGCAGGAAG CCTGGGAAGG CTCGGTCTGG CATCTGGGCG AGGGCTTCAC CAAGCCCCAA
GGCACTGGCC GTCCGCACGT CGTCGCCATG GATTTCGGCG CGAAGGACAA CATTTTCCGC
AATCTGGTGA AGGCCGGCGC GGACGTGACG ATCGTGCCCG CCGAAACCAG CCTCGAGACG
ATCCTTTCGC TCAAGCCCGC AGGCGTGTTC CTGTCGAACG GTCCGGGCGA CCCGGCGGCG
ACCGGCGCCT ACGCCGTGCC GGTGATCCAG AAGCTGCTCG AGATGGACAT GCCGGTTTTC
GGCATCTGCC TCGGCCACCA GATGCTGGCG CTCGCCGCCG GTGCACGCAC CACCAAGATG
CACCAGGGCC ACCGCGGTGC GAACCACCCG GTCAAGCGCA TCGAGGACGG CGTCGTCGAG
ATCACCTCGA TGAACCACGG CTTTGCTGTC GACAACGCGC CGGGCGCGCT GGGCGACAAG
GTGATCGAAA CTCACGTCTC GCTGTTCGAC GGCTCGAACT GCGGTATCGC GGTCAAGGGC
AAGAAGGCGT TCGGCGTGCA ATACCACCCG GAAGCTTCGC CCGGCCCGCA GGACAGTTTC
TACCTGTTCG AGAAGTTTGT AGCGTCGCTC GTTTGA
 
Protein sequence
MADPALSHAP QPDGATGVLV LADGSVAWGR GFGAAGAAVG EVCFNTAMTG YQEVMTDPSY 
AGQVVTFTFP HIGNVGTNPD DMESHDTPGA VGCVVREDVT APANFRAAGT FQQWMEQQGK
IGISGLDTRA LTRRIRLSGA PNAVIAHDPK GQFDIPALIE RARSWPGLEG MDLARVVTRD
GQEAWEGSVW HLGEGFTKPQ GTGRPHVVAM DFGAKDNIFR NLVKAGADVT IVPAETSLET
ILSLKPAGVF LSNGPGDPAA TGAYAVPVIQ KLLEMDMPVF GICLGHQMLA LAAGARTTKM
HQGHRGANHP VKRIEDGVVE ITSMNHGFAV DNAPGALGDK VIETHVSLFD GSNCGIAVKG
KKAFGVQYHP EASPGPQDSF YLFEKFVASL V