Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3875 |
Symbol | |
ID | 5077486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 44083 |
End bp | 45630 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640480984 |
Product | FAD linked oxidase domain-containing protein |
Protein accession | YP_001165646 |
Protein GI | 146275485 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.266937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACAAG GCGTCGACGC CGCAACCTTT ACCAAGGCTC TGGACGAATT AGCGGCTATC GTCGGCAAGG AATGGGTCTT CGTCGATGAA CTGCCGCTGT CGGCCTATCG CGACGCCTAT TCACCGCTGG CCGATGGCGA GATGCTGCCC TCGGCTGCTG TGGCACCGGC CAACATGGAA CAGATTCAGC AGGCGCTGAA GGTATTCAAC GCTTACAAGC TGCCGATCTG GACATTCGGC AACGGCCGCA ATTTCGCCTA TGGCGGCCCG GCCCCGCGCC AGTCCGGTTA TGTCATGTTC GACCTCAAGC GGATGAACCG CATTCTCGAA GTCAACGAGA AATACGGCTA TGCGCTGGTC GAACCGGGCG TTTCCTATTT TCAGCTTCAC CGCCATCTGC GCAAGATCGG CAGCAAGCTT TGGGTCGATC CTGCGGCACC AGGTTGGGGC GGCGTCATGG GCAACGCGCT TGAACATGGC GCGGGTTACA CCCCCTACGG CGATCACTTC GTTATGCAGT GCGGCATGGA AGTGGTCCTT GCCGACGGTC AGGTCGTCCG CACCGGTCAG GGCGCGATCG AGGGGTCGCA GCATTGGCAA TCCACCAAGC ATGCAGCCGG CCCGCATTTT GATGGCATGT TCACTCAGTC CAATTTCGGC ATTGTTACCA AGATGGGTAT CTGGTTGATG CCCGAACCGC CGGGCTACAA GCCGTTCATG ATCACTTACG AGCGCGAGGA AGACCTTGCC GCAATCTTCG ACGCGGTCAA ACCGCTCAAG GTTAACCAGG TGATTCCCAA CGCCGCGGTG GCGGTCGATC TCTTGTGGGA AGTGTCCGCC AAGACCACGC GCCGCCATTA CTTCGATGGC AAGGGCCCTA TCCCGGATTC GATCCGCAAG AAGATCGCGT CGGATCATGG CCTGGGCATG TGGAACTTCT ACGCCGCGCT CTATGGTCCG CCGCCGATCA TCGAGAACAA CTGGAAGCTC GTCGAGGAAG CGATGATGAG CATTTCGGGT GCCAAGCTGC ACCTCAACCG CGAAAACGAT CCCGCCTGGG ATTATCGCGT GCGGCTGATG CGCGGCGAGC CGAACATGAC CGAATTCAGC ATCATGAACT GGATCGGCGG CGGCGGGCAT ATCAACTTCT CGCCAATCTC GGCACCCGAC GGCAAGGAAG CGCTGAGCCA GTATAACCTG ATCAAACAGC GCTGCCACGA TTTCGGGTTC GACTATATCG GCGAGTTCCT GGTCGGCTGG CGCGACATGC ACCACATCCT GATGATCATG TACGATCGCG CCGACGACGG CATGCGCAAG AGCGCCTATG ACCTGTTCGG CAAGCTGGTC GACGAGGCAG CCGGTGCGGG CTTCGGCGAA TACCGCACCC ACCTCGCCTT CATGGACCAG ATTGCCAAGA CCTATAAGCA CAACGATGGG GCGCTGTGGG ACCTGCACCA CCGTCTCAAG GACGTGCTCG ACCCCAACGG CATCCTCTCC CCCGGCAAGC AGGGGATATG GCCCCAGGCG ATGCGCAACC AAGCGTAA
|
Protein sequence | MPQGVDAATF TKALDELAAI VGKEWVFVDE LPLSAYRDAY SPLADGEMLP SAAVAPANME QIQQALKVFN AYKLPIWTFG NGRNFAYGGP APRQSGYVMF DLKRMNRILE VNEKYGYALV EPGVSYFQLH RHLRKIGSKL WVDPAAPGWG GVMGNALEHG AGYTPYGDHF VMQCGMEVVL ADGQVVRTGQ GAIEGSQHWQ STKHAAGPHF DGMFTQSNFG IVTKMGIWLM PEPPGYKPFM ITYEREEDLA AIFDAVKPLK VNQVIPNAAV AVDLLWEVSA KTTRRHYFDG KGPIPDSIRK KIASDHGLGM WNFYAALYGP PPIIENNWKL VEEAMMSISG AKLHLNREND PAWDYRVRLM RGEPNMTEFS IMNWIGGGGH INFSPISAPD GKEALSQYNL IKQRCHDFGF DYIGEFLVGW RDMHHILMIM YDRADDGMRK SAYDLFGKLV DEAAGAGFGE YRTHLAFMDQ IAKTYKHNDG ALWDLHHRLK DVLDPNGILS PGKQGIWPQA MRNQA
|
| |