Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1642 |
Symbol | |
ID | 3918751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1716319 |
End bp | 1718190 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444383 |
Product | carbamoyl-phosphate synthase L chain, ATP-binding |
Protein accession | YP_496916 |
Protein GI | 87199659 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.473054 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAAT CCCTCCTCGT CGCCAATCGC GGCGAAATTG CCTGTCGCGT GATCCGTACA GCGCGGCGCA TGGGCGTTCG CACAGTGGCG GTCTATTCCG ATGCCGATGC CAATGCCCTG CACGTCCGCT CGGCCGACGA AGCAGTCCAC ATAGGTCCGG CAGCAGCGCG CGAAAGCTAT CTGGTGGGAG AGCGCATCAT TGCCGCCGCG CTCACCACTG GCGCTGAGGC CATCCACCCC GGCTATGGGT TCCTCTCGGA GAATGCCGAG TTCGCTCAGG CCGTTCTCGA CGCGGGCCTG GTCTGGGTTG GCCCGAAGCC GCACTCGATC ACCGCGATGG GCCTCAAGGA CGCCGCCAAG GCCCGCATGA TCGCGGCAGG CGTGCCGGTG ACGCCGGGAT ATCTGGGCGA GGATCAATCG GCGGAGCGAT TGCAGGCCGA GGCCGACGCC ATCGGCTACC CGGTACTGAT CAAGGCCGTC GCGGGCGGAG GCGGCAAGGG CATGCGGCGC GTCGATGCCG CGGCGGACTT TGCCGAAGCG CTCGCCTCGT GCCGCCGTGA GGCTGCCTCC TCGTTCGGCG ACGACCGTGT GCTTATCGAG AAGTACATCC TTTCCCCGCG GCACATTGAG GTTCAGGTCT TCGGCGACGC CCACGGCAAC GTCGTCCACC TGTTCGAACG CGACTGCTCG CTTCAGCGGC GGCACCAGAA GGTGATCGAG GAGGCCCCTG CGCCCGGCAT GGACGAAGCG ACCCGCGAGG CTGTCTGCGC GGCCGCCGTC CGCGCGGCCA AGGCGGTCGA CTATGAAGGC GCGGGCACCA TCGAATTCAT CGCCGATGGC TCCGAAGGCC TGCGCGCGGA CCGGATCTGG TTCATGGAAA TGAACACGCG CCTGCAGGTG GAACATCCGG TGACCGAGGA GATCACCGGC GTCGACCTCG TCGAATGGCA GCTCCGCGTC GCGTCGGGTG AGCCGCTGCC CAAGCGGCAA GACGAGCTAT CGATCAACGG CTGGGCGATG GAAGCCCGGC TCTACGCCGA GGACCCGACC CGGGGCTTCC TGCCCAGCAT CGGGCGCGTC GACGATTTCC ACTTCCCGCA CCATCATGCG CGCATTGATA CGGGGGTAGA GGCGGGCGCG GAAATCTCGC CCTTCTACGA TCCGATGATC GCCAAGCTCA TCGTCCACCG CCCGACCCGG ACTGAAGCCG TCTCTGCGCT GCGCGAGACA CTTGACGAAG GCATCGTCGG ACCGCTCGTC ACCAACAGCG GCTTCCTCTG GCGCCTGCTG GGCCACGCGG CGTTCGAGGC CGGGGTCGTC GACACCGGCC TGATCGAGCG CAATCTCGAA ACCCTCGCCA CCCGGCCCGA GCCTTCCCGC GAGGGTCTGG CTCTGGCCGC AATGCGCCTT GCCGGCACGC CGGGCGCGAC GCCATGGTCG AGCCGGTCCG GCTTCCGCAT GAACGCTGCC CCGCGCCGCG ACGTTCGTCT TTCCGACCAG TTCGGCCGGA CGTTCACGAC CGAACTGCCA CCCGAACCGG CGTTCGACTA CTGGCCCGGC GAAGACGCGA CGACGATCGA CGAAGGTGGC GAGCGCTTTC GCGTGCGCCT GGCGCGGGCC GACGGGGGCT CCGGCGGCGC AGCCTCGGAT GGGGCCATCC TCGCCCCCAT GCCCGGCAAG GTCATCTCGG TCGATGTATC CGCAGGCCAG TCGGTGACCA AGGGCCAGAA ACTCATGGTG CTCGAGGCAA TGAAGATGGA ACATGCCCTC ACCGCCCCCT TCGACGGTGT CGTGGCCGAA CTCAACGCCG CGCCGGGCGG ACAGGTTCAG GTCGAGGCAC TGCTGGCGAA GATCGAAAAG GGAGAAGCCT GA
|
Protein sequence | MIKSLLVANR GEIACRVIRT ARRMGVRTVA VYSDADANAL HVRSADEAVH IGPAAARESY LVGERIIAAA LTTGAEAIHP GYGFLSENAE FAQAVLDAGL VWVGPKPHSI TAMGLKDAAK ARMIAAGVPV TPGYLGEDQS AERLQAEADA IGYPVLIKAV AGGGGKGMRR VDAAADFAEA LASCRREAAS SFGDDRVLIE KYILSPRHIE VQVFGDAHGN VVHLFERDCS LQRRHQKVIE EAPAPGMDEA TREAVCAAAV RAAKAVDYEG AGTIEFIADG SEGLRADRIW FMEMNTRLQV EHPVTEEITG VDLVEWQLRV ASGEPLPKRQ DELSINGWAM EARLYAEDPT RGFLPSIGRV DDFHFPHHHA RIDTGVEAGA EISPFYDPMI AKLIVHRPTR TEAVSALRET LDEGIVGPLV TNSGFLWRLL GHAAFEAGVV DTGLIERNLE TLATRPEPSR EGLALAAMRL AGTPGATPWS SRSGFRMNAA PRRDVRLSDQ FGRTFTTELP PEPAFDYWPG EDATTIDEGG ERFRVRLARA DGGSGGAASD GAILAPMPGK VISVDVSAGQ SVTKGQKLMV LEAMKMEHAL TAPFDGVVAE LNAAPGGQVQ VEALLAKIEK GEA
|
| |