Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3292 |
Symbol | |
ID | 3915939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3510824 |
End bp | 3512479 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640446077 |
Product | hypothetical protein |
Protein accession | YP_498561 |
Protein GI | 87201304 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.648206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGTT TGACGCAAGT CCGCGAAAAG GATTGGCTGC AGGGCCTCAA GGCGATCTGG GCGCGCGCCG GTGCGCTGCC GTGGCTCTCC GGGCCGGGCA TCCTGCTGTG CGTTCTGATC GTGTGCCTGC CTTTCGCGCT GGTCGACGTG CCGCCGCTGA TCGACGTGCC GGGCCACATG GGCGCCGCCG CGATCGAGGC TGCCGGTCCG GGCAATCCGC TGGAAAAGTA CTTCACCTGG AAATGGGTGT TCACGCTGAA CATCGGCGGC GGCGTGCTGA TGAAGGTGCT GGGCGCGCAG TTCGGCATAC TTGCTGCGGG GTGGTGGAGC ACGGTCCTGG CCACCGGCCT GTTCGCGGGC GGCTGCCTGT GGACGATCCG CATGCTCAAT CCCCGGGGCG GGCACGCGGC GGCATGGTCG CTGATGTTCG TGTTCAGCTT TCCCCTGCTC ACCGGCTTCC TCAATTACAT CCTCGCTACC GGCCTCTCGC TGACCGCGTT CGGCGCGTCG CTGTGGCTGG AACAGAAGAA CCCGCGCGCC CGCGCGGCGA TGCTGATCGT GGCGCAGCCG GTCGCAATGC TGTGCCACGC CATCGGCGGC CTTCTGCTCG CGCTTCTCGT CGCTGCCCAC GCCTTTGGCC GGGCGATGGA CGAACTGCCC GAAGGCTGGC GCTGGCGCGA TCTCACCAAC CGCCAGTGGC TGAGAACGCT CGACTGGAAG GCGATCGGCC TGCGTCTGTG GCAGGCCTGC TGGCCCTTGC TCGCCACGGT TGTCACCATC CTGTTGTGGA AGGCATTTTC GCCGCCCGCC AAGAGCATGA ACGTCTGGCG CTGGGACCAG AAGGCCTGGT CCTTCGTGCT GACCCTGCGC GACCAGTCCA AGCTGCTTGA TTTCGGCACG TCGATCATCG CCGGCCTGCT GGTTCTGGTT GGCCCGTTCC TCGGCGCGAA ATGGCGCTGG CGCCAAGGCC TGCCCGCGCT GACCGTCTTG CTGCTGTTCA TCGCGATCCC AAGCGATATC AACGGCTCCG CTTTCGTCGA CATCCGCCTT CTGCCGGTTG CGGCCATGCT CGGCCTCGGT CTCCAGGACT GGAGCGGGGC GCGCCGTCCG CAATGGGCTA AGATCGTGGC CTGTCTTGGC ATGGCGCTTC TTGCCGTACG CCTGACGGTC ACCGCGTGGA GCTTTGCCGG CTATGCCGAG GACTACAACA AGCAGCTTTC CGCGCTGACC CATGTCGAAC CGGGCAGCCG CGTCCTCGCC TTTGTCGAGC ATTCCTGCCT CGACGAATCG TGGCGCAACA CCCGGCGCGA TCACCTGGCG AGCCTCGCCA GCCTCTATCG CCAGGCGTGG GTGAACGACA ACTGGGCCGT TCCGGGCCTG CACATGGTCA TTCCGCGCTT CCGCCCCGGC CGCAACTTCA CCGCCGATCC TTCCGAATTC GTCTGGTCGC GCCGCTGCGC CGGCGGCTGG CGCCGTACGG TCGAGACAGC GCTCAAGCAT GCCCCGATCG AGCGGGTGGA CTATGTCTGG CTGATCGATA CCGGCCTGCC GCGCCGCGCC GACCCGCGCC TCCAACTCGT GTGGCAGGAA GGCCGCAGCC TGCTGTTCAA GGTGCGTCCG CTTGGCATCC CGACCTGGAA GCCGAAGGAT TTCTGA
|
Protein sequence | MTGLTQVREK DWLQGLKAIW ARAGALPWLS GPGILLCVLI VCLPFALVDV PPLIDVPGHM GAAAIEAAGP GNPLEKYFTW KWVFTLNIGG GVLMKVLGAQ FGILAAGWWS TVLATGLFAG GCLWTIRMLN PRGGHAAAWS LMFVFSFPLL TGFLNYILAT GLSLTAFGAS LWLEQKNPRA RAAMLIVAQP VAMLCHAIGG LLLALLVAAH AFGRAMDELP EGWRWRDLTN RQWLRTLDWK AIGLRLWQAC WPLLATVVTI LLWKAFSPPA KSMNVWRWDQ KAWSFVLTLR DQSKLLDFGT SIIAGLLVLV GPFLGAKWRW RQGLPALTVL LLFIAIPSDI NGSAFVDIRL LPVAAMLGLG LQDWSGARRP QWAKIVACLG MALLAVRLTV TAWSFAGYAE DYNKQLSALT HVEPGSRVLA FVEHSCLDES WRNTRRDHLA SLASLYRQAW VNDNWAVPGL HMVIPRFRPG RNFTADPSEF VWSRRCAGGW RRTVETALKH APIERVDYVW LIDTGLPRRA DPRLQLVWQE GRSLLFKVRP LGIPTWKPKD F
|
| |