Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0022 |
Symbol | |
ID | 3916064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 17734 |
End bp | 19323 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640442747 |
Product | hypothetical protein |
Protein accession | YP_495305 |
Protein GI | 87198048 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.575522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGTCGAA TTTTATTAAG AACAGCCAGC GCCTTGGCGG CGCTTTGCCT CGCCATGCCA GTTCAGGCAA AATGGTACGA GGCATCAAGC AAGCACTTCG TGGTCTATTC CGACCGCGGG ACTGAACAAG TCCGGATAAT GGCCTCCAAG CTCGAGCGCT TTGACGCAGC GCTGCGTCAG CTTACCGGTC GGACTGACGA ACCTATCGCC CCGTCGAACC GGGTCACCAT TTTCATGGTG CAGTCGCCGA TCGATGTCGC CCGCCTGTTC GGAAAGGGCG GAAACTACGT CGGTGGCTTT TACTCCTCGA TCGCAGGCGC GTCATACGCG ATCGTCCCGC CATTCATTGT GCAGGAGACC GGTGAGAACG GTTTCGCCGA GGGCGTATTG TTGCACGAGT ACGCGCACCA TTTCGTTTCG GAAAACCAGT CGATCCTCTA CCCGATATGG CTGAACGAGG GATTCGCCGA GTTCGTCTCA ACCGCACGCT TCGAGAAGAA CGGTTCGATC GGCCTCGGGC TGCCAGGGCA GTGGCGCGGC TGGCAATTGA CGCACCAGGT CTCCGTGCCC GCCGAAATGC TGGTCGACAG CCAGGCCTAT TTCGCGCGGC GCGAGTTGGC GTTCGACCAG TTCTACCCCC GCGCGTGGCT GCTTTATCAC ATGCTGACGT TCGAGGCCTC GCGGGAAGGG CAACTCGCCG CATATGCAGG TGCGCTCAGC CGCGGGCTGA ACGACCGGGA TGCCGCCATC GCGGCATTCG GCGATTTGCG TCAGCTCGAA CTAGATCTTC AATCGTACGG CCGGAAGGTC ACGATGCCCT ACTACCTCAT AAAGGGGGAA ACCTTGCGCC CCGCTCAGGT GAACGTACGG GAGCTGAGCA AGGAAGATGC CGAAGCGATG CCCTTCTGGA TGCGCATTCG CCGAGGCTTG GAAGAAGGGA ACGCCGAACC GCTGGCGGTT GAAGTTCGGG CAATGGCCGC GCGCAACCCC ACGAGTGCCT TCGTGCAGAC CATCCTCGGG ATGGCGGAGT TCGAAGCCGA TCATGTCGAT GCTGCACTGG CCGCTGCCGA CACGGCAATC GCAATCGACC CGATGGCGAT CGAGGCTCAC GTCCTCAAAG GGCGCGCGAT GCTGGCCAAA GCGTCCCGGA ACGGCGCGAC GCCCGTCGAA TGGAACGCTG TGCGCGCGGC ATTTCTCAAG GCGAACGCGA TCGACCCTGA TCACCCGCAA CCTCTGTTCC TGTACTACGC ATCGTTCTTG AGTCAGGGGT TCAAGCCCAC CGGTAACGCC TTGCAGGCTC TCTCCCGCGC GCTGGAGCTC GCGCCATACG ACCGTGTAAT CCGCGCGGAA CTGGCGGAGA GTCAAGTTTC TCGCGCCCAA TTCGAGGAGG CGAAGTCGAC GATACGCATG CTCATGCGTG ATCCGCATTC TCCACTGGGC GGCGCCCGCA TCCGCGCGGT CATGGAAAAG CTCGACGAGC GCAACGCCGA AGGTGCTCGC AAACTTCTCG AGATGAGCAA CGAGGAGTTC AAGGCAAAGC AGGAGAAGGG TGACCCAGAC AGTGGCGCCG GTCGCGAGGC GGCTGCCTGA
|
Protein sequence | MRRILLRTAS ALAALCLAMP VQAKWYEASS KHFVVYSDRG TEQVRIMASK LERFDAALRQ LTGRTDEPIA PSNRVTIFMV QSPIDVARLF GKGGNYVGGF YSSIAGASYA IVPPFIVQET GENGFAEGVL LHEYAHHFVS ENQSILYPIW LNEGFAEFVS TARFEKNGSI GLGLPGQWRG WQLTHQVSVP AEMLVDSQAY FARRELAFDQ FYPRAWLLYH MLTFEASREG QLAAYAGALS RGLNDRDAAI AAFGDLRQLE LDLQSYGRKV TMPYYLIKGE TLRPAQVNVR ELSKEDAEAM PFWMRIRRGL EEGNAEPLAV EVRAMAARNP TSAFVQTILG MAEFEADHVD AALAAADTAI AIDPMAIEAH VLKGRAMLAK ASRNGATPVE WNAVRAAFLK ANAIDPDHPQ PLFLYYASFL SQGFKPTGNA LQALSRALEL APYDRVIRAE LAESQVSRAQ FEEAKSTIRM LMRDPHSPLG GARIRAVMEK LDERNAEGAR KLLEMSNEEF KAKQEKGDPD SGAGREAAA
|
| |