Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3531 |
Symbol | |
ID | 5077680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 148235 |
End bp | 149311 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481255 |
Product | hypothetical protein |
Protein accession | YP_001165917 |
Protein GI | 146275757 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCACAC CCGACGATCT CATCGCCCGC CACGGCATCC TCAACGCCCT GGCCAACCAC AGCCGCGGGG TGGACCGCGC CGATGGCAAC CTGCTCGGCT CTGCCTATCA TCCCGGCGCC ACGGTCGATT ACGGCTTCTT CGCGGGCCCG GCCGCAACCC TGGTCGACAT TCTGGCAGGC GCGCAGAAAA CTGCCCTGCC CACGCTCCAC CGCACGTCCA ATTGCTGGAT CCGGGTCGAT GGCCGCCGCG CCATCTCGGA ATCAAGCGTG ATCGCCTATG TCGAGGAGGC GGACCTGCAA CGCTGGGTCT TCGGCCGCTA TCTCGACCGG CACGAATGCC GGACGAACGA AGCCGGCGAG GACGAATGGC GCCTCGTCCA CCGCACCTAC GTGCTCGACG GCAACGTCAA CCGGCCATCC ACCGCCGCGC GCTCCGATCC GCCCGTCGGC CTTGCCCATT TCGTGCCCGC CGGCGGCAAG GGCGCCGCCG ATCCGGGCCG CGCGCTGCTT GCCTTCCACG CCGCCTGCGC CCGCCCCAAC GGTTCGAGGA ACACTGCCAT GTCCGCACCT GAAACCCGCG AAGCCGCGCT CGACGCCGCG CTTGCCCGCG CCGAGATCCA CGACCTGTGC ATGGCCTATG CGCGCGGCGT GGACCGCGCC GACGCCGATC TTCTCGCCTC GATCTTCACC GACGATTCCA CCGTGATCTC GGGTGTCGTG AACGGCTCGG GCAAGGATTT CGCGCGCGAT ATCACCGCCT TCGTGCGCGA CAACCTCGAG ATGTGCTTCC ACTCGGTCGC CAACGAATGG ATCGAGGTTC GCGGCGACGA GGCCGTGGGC GAACATTATG CCCTGGCCCA GATGGTCCAG GCAGGCACCG AAATCCTGAC CGGCGGCCGC TATATCGACC GCTACGTCCG GCGCGACGGC AAGTGGCTGA TCCTTAGCCG CACTTTCGTC GCCGACTGGA CCCATTCACA CCCCTCGACG ATGGAACGCG ACGGCTTCTA CGAGGCGCTC AGCATCCGCG GCTGCTTCGG CCACGAAGAC CCGATCTACG CCCACTGGGC GGCATAA
|
Protein sequence | MVTPDDLIAR HGILNALANH SRGVDRADGN LLGSAYHPGA TVDYGFFAGP AATLVDILAG AQKTALPTLH RTSNCWIRVD GRRAISESSV IAYVEEADLQ RWVFGRYLDR HECRTNEAGE DEWRLVHRTY VLDGNVNRPS TAARSDPPVG LAHFVPAGGK GAADPGRALL AFHAACARPN GSRNTAMSAP ETREAALDAA LARAEIHDLC MAYARGVDRA DADLLASIFT DDSTVISGVV NGSGKDFARD ITAFVRDNLE MCFHSVANEW IEVRGDEAVG EHYALAQMVQ AGTEILTGGR YIDRYVRRDG KWLILSRTFV ADWTHSHPST MERDGFYEAL SIRGCFGHED PIYAHWAA
|
| |