Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2202 |
Symbol | |
ID | 3918868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2342460 |
End bp | 2343707 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640444957 |
Product | hypothetical protein |
Protein accession | YP_497474 |
Protein GI | 87200217 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000618682 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCAAGG GGGCGATCTT TTCGCTGCAC GATGTGCTCG TGACCAAGGG CACGATCAAC GCGCCGCTGT TTGAGGAAAC GCTGCGGCTG CTGCGCTATC TCAAGGCGCG GGGTGTCGAA CCGGTTTTCA TTGGCAACCA TGACTGGACG GTCACCAGTC CCGGCCAGTC GAAGCCGTTC CGGACCCTGC TCGAAGAGCG GCTCGGTCCG ATCAGCTATT ATATCGGCGG CCAAAACGGG ATGCCTTATA AGCCACGCGC CGATTCCACC GCGCATATCC TCTCCGACAA GGGCTGGCAG CGGAATGAGG TCCTTTATGT CGGCAACACA ACCGACGATA TGAAGACCGC TGCCAATGGC GGCCTGATGT TTGTGAACGT CATGTGGCAC GGAGTGGCGA GCCCCTACGG CTTTCAATTC GACTCTCCAC GCGACGTCGC GCGCTTCGTC GATTGCCTCT GTCTGGGCCT CGACGGCTGG TTCTGGGCGC TCGAACAGAG CGATCTGCGG GTCTATGCGC TCGCGCCTTT CACAACGCTC TCGCCGCGCT ACGCACAAGC GCATGCCTAT TCTGAAAACG CCAAGGCGAC CTCGAAACAC GGTGCCGGTG ATGCGAATTT CTGGGGCCGT CTGCTCGCGG CGCGCATCTA TTTTTCAGGT CTCGCTGACG AGATCGACTA TATCACCGCC TATCCCGGGC ACGCGCCTAC TTCCAACGCG ACGGTGATCA GTGAGGCGCT TAACATCCTG GGGCAGTCAC TGCGCAAGAG CTATCTGCCC GACCTCATTC TCCGTCACAC CAAAGCGGTG AAATCGCAGA CCGCGCGGGC CTCAGGGGGA AGCGTGGGCC TCGACAATCA GCTCAACACG ATCCGGCTCA ACCCGGCACC CGTCCGCGGC GTGGGCGGCA AACCCTATAA GTCGCCGCCC GCGCGCGGCG GCAAGCGTGT CCTCGTTATC GATGATATCT GCACCGAGGG TAACAGCTTC GAGGGTGCGC GGGCCTATCT GAGGGCCGCA GGAGCGCAAA CGGTCTGCGT GAGCTGGCTC AAGACGATCA ATAAGGACTA TCGCGCCGTG TCACCAGCCT TCGGCCCGTT CAATCCCTAC ATCGCGCAAA CCTTCCCGAC ACCGATCGCG ACCACAACGC ACTGGTATTC GAGCGCGATC AGCTCGCATG CTGCGCCGAC TGACCTCGCC GACGTCTATA ATCGCTACTT CAGCTGGGCT TGGCCCGCCG ATATATGA
|
Protein sequence | MLKGAIFSLH DVLVTKGTIN APLFEETLRL LRYLKARGVE PVFIGNHDWT VTSPGQSKPF RTLLEERLGP ISYYIGGQNG MPYKPRADST AHILSDKGWQ RNEVLYVGNT TDDMKTAANG GLMFVNVMWH GVASPYGFQF DSPRDVARFV DCLCLGLDGW FWALEQSDLR VYALAPFTTL SPRYAQAHAY SENAKATSKH GAGDANFWGR LLAARIYFSG LADEIDYITA YPGHAPTSNA TVISEALNIL GQSLRKSYLP DLILRHTKAV KSQTARASGG SVGLDNQLNT IRLNPAPVRG VGGKPYKSPP ARGGKRVLVI DDICTEGNSF EGARAYLRAA GAQTVCVSWL KTINKDYRAV SPAFGPFNPY IAQTFPTPIA TTTHWYSSAI SSHAAPTDLA DVYNRYFSWA WPADI
|
| |