Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3958 |
Symbol | |
ID | 5077442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 130001 |
End bp | 131227 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640481064 |
Product | hypothetical protein |
Protein accession | YP_001165726 |
Protein GI | 146275565 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCAGC GATACCGTGT GCTCGGCCGC AGCGTGGCCG AAGGCGATGA ACTTCAGCCG CTTTTGGCAA AAGCCTACGC GCAAAAGGCC CAGGTCCTGT GCGAATGCCG CAAGGGGACC GACCTGCCGC TCTACATTTC GCACAGGAAC GACCGCTACG TGCTGGCGCG CTGGCCGGGC TCTGGCGCGC GGCACGCGAC CGCCTGCGAC CACTACGAGG CGCCCGATTA CCTGACCGGC ATGGGCCAGG TCCGCGGCGC GGCGATCATC GACGATGAGA CGAGCGGCGA AACCAGCCTC AAGCTCGGCT TCCCGCTCTC ACGTGGCAGC GCCCGGCTCG CGCCCGCGGC GATAACCAAC GACAAGCCGA CGGTGAAATC GTCGGGCCAG AAGCTGTCGA TGCGCGGGCT GCTCCACGTC CTATGGGACC GCGCCGAGCT CACGCACTGG CATCCCAAGA TGGCAGGCAA GCGCAGCTGG TTCGTGGTGC GCCGCGCGCT GCTCGAGGCA GCCGCGTCAT GCCGGGCGAA GCAGGAGGCC CTTCCCCATG TGCTGTTCGT GCCGGAAAGC TTCAAGCTGG AGGAGAAGGA GGACATCCGC GCCCGCCGCC GCACCGCGCT CGAGCGGGTC TACCGCTCGC GCGACGAGAT GATGGTGGTG GTCGGCGAGA TCAAGGAGAT CGTCTCGGCC CACGGCGCGG AGCGGATCGT CCTGCGCCAC GTCGGCGACA TGCCGTTCGT GATGGACACC GATATGGCGC GCCGGTTCCA CAAGCGGTTC GCGGGCGAAC TCGCGCTGTG GCAGGCGCAG CATGGCAGCA AAGCCGAGCA GGATCACCTG GTGATCGCGG GCTCGTTCGC GCGGCGGCGC GAAGGCACCT TCGACCTGAT CGAGGTCGCA CTGATGCCGG TGACGCCCGA ATGGCTCCCC TACGAGACCA GCGACGAGCG CTATCTGATC GCCAAGGCGG TCGCCGAGAA GCGCCGGTTC GTGAAGGGCC TGCGCGTCAA TCTCGACGTC GACATGCCGA TCGCGAGCCT GGTGCTGAAG GACACTGGCG AGGAAGCCTG CGCGGTGCAT ATCCATGACC GCGACAACGA AGTGGCCGAG CCGCTCGAGG CGCTGCTCGC CGGGCAAGGC GTCGCGCACC GGTTCTGGAA GGAAGGCGAG CCGCTCCCGG CGCGCGTCAC GCGGCAACGA CGCTGGGAAG CGCAGGCCGC AGCCTGA
|
Protein sequence | MIQRYRVLGR SVAEGDELQP LLAKAYAQKA QVLCECRKGT DLPLYISHRN DRYVLARWPG SGARHATACD HYEAPDYLTG MGQVRGAAII DDETSGETSL KLGFPLSRGS ARLAPAAITN DKPTVKSSGQ KLSMRGLLHV LWDRAELTHW HPKMAGKRSW FVVRRALLEA AASCRAKQEA LPHVLFVPES FKLEEKEDIR ARRRTALERV YRSRDEMMVV VGEIKEIVSA HGAERIVLRH VGDMPFVMDT DMARRFHKRF AGELALWQAQ HGSKAEQDHL VIAGSFARRR EGTFDLIEVA LMPVTPEWLP YETSDERYLI AKAVAEKRRF VKGLRVNLDV DMPIASLVLK DTGEEACAVH IHDRDNEVAE PLEALLAGQG VAHRFWKEGE PLPARVTRQR RWEAQAAA
|
| |