Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3031 |
Symbol | |
ID | 3916643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3242442 |
End bp | 3243542 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640445811 |
Product | hypothetical protein |
Protein accession | YP_498300 |
Protein GI | 87201043 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00944915 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TCGGTCTCTC CATTGCCGCC CTGCTCGCCG GAACGACCGT CGCGAGCGCT TCCGCCCAGG CCTCCACGCT GTTCATGGGC TCCTATCCCG ACCGCATGCT GATCGTCGAC GAGGCATCGG GCAAGGTCAC CGACACGCTG ACGCTCGCCT CCGGCCTGCC GACCTCGATG CGGATCTCGA ACGACCGGAA GAAGATCTAC GTCACCACGA TCACGACCAG CGGGATTGAG GTGATCGACA CCGCCACGAA GAAGGTGGTC AACTCCTTCA GCCTGAACAC CCCGACCACG CGCTATCGCT TCAATGGCGG GGTGCCTGAT CCTTCGGGGC GCTATTTCTA CACGATGCTG ACGAAGTTCG AGAAGCTGAA CGACCGCTAC CTCGTCAGCC CTCAGCAGTT CGCAGTGATC GACCTTCAAA AGAAAGCCGT GGTGCGCACG TCCGAAGTGC CCAAGGAAGA TGACAGCAAC CCCAACGCCG GCTGGCGCAC CAACTACATG ATGTCCGAGG ACGGCAAGAC CTTGTTCGTG ATCCGCGACA AGGTGCTCGT GCTCGACACC GCCGACCTCA AGGTCAAGGA GCGGATCGAG GTTTCGCGCC CCGAGGCCAC CGGTATCGAG GGCGTGACCT TCGGCGGCGG GGTCGAAGCG CTGCGAAACC CGCACGAATA CGTCTCGCTG TTCAACGCGA CCGATCCCTA CATTCACAAC AAGATCTTCG GCGTCGGGCG CTTCAACCTG GCGACCAAGG CCTTCGACTT CCGCCCGATC GGCCCCGCGC CCTATGGCAT GGCCGGCCTG CAGGTCTCTC CCGACCTCAA GCAGGGCTGG ACGGTCGTCA CCAACGGCAG CGTGGGCAAC AAGCGGTGCG AATTCTGGCA TCTCGACCTC ACCACCAACC AGGTGAAGAA CAAGGCCGAA TTCCCCTGCC GTTCGCGCTT CCAGTTCGGC ATGTCGGGCG ACGGCACGAA GCTCTACATC TACGGCGCCA GCTACGACAT CGAGATCTAC GACGCACAGA CTCTGGCGCA CGAAAAGACG GTCGATCTCG GCGCGGACTC GACCGGCGCC GGGATGATAA TCACCCAGTG A
|
Protein sequence | MKKLGLSIAA LLAGTTVASA SAQASTLFMG SYPDRMLIVD EASGKVTDTL TLASGLPTSM RISNDRKKIY VTTITTSGIE VIDTATKKVV NSFSLNTPTT RYRFNGGVPD PSGRYFYTML TKFEKLNDRY LVSPQQFAVI DLQKKAVVRT SEVPKEDDSN PNAGWRTNYM MSEDGKTLFV IRDKVLVLDT ADLKVKERIE VSRPEATGIE GVTFGGGVEA LRNPHEYVSL FNATDPYIHN KIFGVGRFNL ATKAFDFRPI GPAPYGMAGL QVSPDLKQGW TVVTNGSVGN KRCEFWHLDL TTNQVKNKAE FPCRSRFQFG MSGDGTKLYI YGASYDIEIY DAQTLAHEKT VDLGADSTGA GMIITQ
|
| |