Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2515 |
Symbol | |
ID | 3916836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2719619 |
End bp | 2720899 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445272 |
Product | amidohydrolase |
Protein accession | YP_497785 |
Protein GI | 87200528 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTGT TCGCCCGTGT TCTCTTGATT GCCTCGAGCC TTTGCGCCGC GCCGCTGGCC GCGCAGAAGC AGGAGGCGCT GACGCTCCTG CGACCCGACG CGGTATTCGA CGGCGAGACC GCCGTCCTGC GCAAGGGTTG GGCCGTGCTG GTGCGGGGCA ACCGGATCGA GGCGGTGGGA CCGGATGTCG GCGCACCTGC CGAGGCCTCG GTGCTGGAAC TGCCGGGAAC GACCCTGATG CCCGGCATGA TCGAGGGGCA TTCGCACCTC TTTCTCCACC CCTACAACGA GACGCCGTGG GACGACCAGG TCCTGCACGA ACCGCTCGCG TTGAGGACTG CGCGGGCGAC GGTTCATGCG CGCGCGACGC TGATGGCAGG CTTTACCACC GTGCGTGATG TCGGCACCGA GGGCGCCGGC TATGCCGACG TGGGGCTGAA GCAGGCCATC GAGCAAGGGA TCGTGCCGGG GCCGCGCATG CTGGTGGCGA CGCGGGCCAT CGTGGCGCCC GGCGCCTACG GGCCGCGCGG GTTCGAGCCG GGCGTGGCAG TACCGCTGGG GGCCGAGGAA GCCGGCGGGC CGGACCTTGT CGACGCGGTG CGTCGGCAGA TCGGGGCGGG TGCGGATCTG GTGAAGGTCT ATGCCGACTA CCGCTGGGGA CCGGGCGAGC CGAGCCGCCC GACCTTTACC GAAGGCGAGC TGAAAGCGGC GGTAGAGGCT GCGCACAGCG CCGGGCGGCA GGTCGTCGCC CATGCCAGCA CGGCGGAAGG AATGCGCCGT GCCGTGGCAG CGGGCGTCGA CACCATCGAG CATGGGGACG AAGGTACGCC GGAGGTCTTC GCCGCGATGA AGGCCAGGGG CGTGGGCTTC TGCCCGACGC TGGCGGCCGG GGATGCGGTG GCGCGCTATC GCGGGTGGAA CGGTACAGCG CCCATGCCGA AGAGCGTGCA GGAAGGGTTC GATGCACTTG CAAAGGCGCG GAAGGCCGGG GTGGCGATTT GCATGGGCGG CGATGTTGGC GTCTATGCGC ACGGCGACAA TGCGCGCGAA GCGGAAATGA TGGTCAAGGG CGGAATGACG CCTGGCGAAG TGGCCATCGC CGCAACATCG GGCAATGCGC GCATGTTCGG CATCGGCGGC CGTCTGGGCG CGGTCAGGAC GGGTATGCTG GCTGACCTCG TGGCGGTCGA AGGCAATCCG CTCGCCGATA TTTCAGCGAT CCGGAAGGTG GCGCTGGTGA TGAAGGACGG CGTGCTGTGG AAAGGGCCTG TGGGGCGCTA G
|
Protein sequence | MRLFARVLLI ASSLCAAPLA AQKQEALTLL RPDAVFDGET AVLRKGWAVL VRGNRIEAVG PDVGAPAEAS VLELPGTTLM PGMIEGHSHL FLHPYNETPW DDQVLHEPLA LRTARATVHA RATLMAGFTT VRDVGTEGAG YADVGLKQAI EQGIVPGPRM LVATRAIVAP GAYGPRGFEP GVAVPLGAEE AGGPDLVDAV RRQIGAGADL VKVYADYRWG PGEPSRPTFT EGELKAAVEA AHSAGRQVVA HASTAEGMRR AVAAGVDTIE HGDEGTPEVF AAMKARGVGF CPTLAAGDAV ARYRGWNGTA PMPKSVQEGF DALAKARKAG VAICMGGDVG VYAHGDNARE AEMMVKGGMT PGEVAIAATS GNARMFGIGG RLGAVRTGML ADLVAVEGNP LADISAIRKV ALVMKDGVLW KGPVGR
|
| |