Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2093 |
Symbol | |
ID | 3917741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2228948 |
End bp | 2230129 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444846 |
Product | hypothetical protein |
Protein accession | YP_497366 |
Protein GI | 87200109 |
COG category | [S] Function unknown |
COG ID | [COG3825] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0341328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTTCA ACTTCGTCGA CGAACTGCGC GCCGCCGGCA TCCCGGCCAG CTTCAAGGAG CACCTCGTGC TTCTCGAGGC GCTGGAAAAG GACGTGATCG AGCAGACGCC CGAAGCGTTC TACTACCTCT CGCGCGCCAC CTTCGTGAAG GACGAGGGCC TGCTGGATCG CTTCGACCAG GTCTTCAACA AGGTCTTCAA GGGCCTGCTG ACCGACTATG GCCAGCGCCC CGTCGACATT CCCGAGGACT GGCTCAAGGC CGTCGCCGAG AAATTCCTCT CGGAAGAGGA AATGGAGAAG ATCAAGTCCC TCGGTTCGTG GGACGAGATC ATGGAGACGC TCAAGAAGCG GCTGGAAGAA CAGGAAAAGC GCCACCAGGG CGGCAACAAG TGGATCGGCA CCGGCGGCAC CAGCCCGTTC GGCAATTCGG GCTACAATCC CGAAGGCGTG CGCATCGGCG GCGAAAGCAG GCACAAGCGC GCGGTAAAAG TCTGGGAAAA GCGCGAATTC GCCAATCTCG ACAACACCCG GGAACTCGGC ACCCGCAACA TCAAGGTCGC CTTGCGTCGC CTGCGCCGCT TCGCCCGCGA AGGCGCGGCC GACGAACTCG ACCTCGAAGG CACGATCGAG GGCACCGCGC GCCAGGGCTG GCTCGATATC CGCATGCGCC CGGAAAAGCG CAATGCGGTC AAGCTCCTGC TGTTCCTCGA CGTCGGCGGA TCGATGGACC CGTTCATCAA GCTGGTCGAG GAACTGTTCA GCGCGGCCAC GGCCGAATTC AAGAACATGG AGTTCTTCTA CTTCCACAAC TGCCTCTACG AGGGCGTGTG GAAGGACAAC AAGCGCCGCT GGTCGGACCG CACCAGGACC TGGGACATCC TCCACAAGTT CGGCCACGAC TACAAGGTCG TGTTCGTCGG CGACGCGGCG ATGAGCCCCT ACGAGATCAG CCACCCCGGC GGCAGCGTCG AACACTTCAA CGAGGAAGCG GGCGCGGTGT GGATGCACCG CGTGGCGCAG ACCTACCCCG CCACCGTCTG GCTCAATCCC GTGCCCGAAA AGCAGTGGGC CTATTCGCAG TCGACCAAGA TGATCAAGGA CCTGATCGGC GGCAGCATGT ACCCGCTCAC CCTCGAGGGT CTCGACGGCG CCATGCGCGA GCTGACCCGC AAGAAGCACT GA
|
Protein sequence | MFFNFVDELR AAGIPASFKE HLVLLEALEK DVIEQTPEAF YYLSRATFVK DEGLLDRFDQ VFNKVFKGLL TDYGQRPVDI PEDWLKAVAE KFLSEEEMEK IKSLGSWDEI METLKKRLEE QEKRHQGGNK WIGTGGTSPF GNSGYNPEGV RIGGESRHKR AVKVWEKREF ANLDNTRELG TRNIKVALRR LRRFAREGAA DELDLEGTIE GTARQGWLDI RMRPEKRNAV KLLLFLDVGG SMDPFIKLVE ELFSAATAEF KNMEFFYFHN CLYEGVWKDN KRRWSDRTRT WDILHKFGHD YKVVFVGDAA MSPYEISHPG GSVEHFNEEA GAVWMHRVAQ TYPATVWLNP VPEKQWAYSQ STKMIKDLIG GSMYPLTLEG LDGAMRELTR KKH
|
| |