Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1862 |
Symbol | |
ID | 3917083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1962769 |
End bp | 1963989 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444606 |
Product | hypothetical protein |
Protein accession | YP_497136 |
Protein GI | 87199879 |
COG category | [S] Function unknown |
COG ID | [COG5441] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0341328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACA AGCCCAGCGT CCTGTTCATC TGCACGCAGG ATACCGAGGA AGAGGAAGCC CGCTTCACCC GCGCCGCGCT CGAGGCGGCG GGCGTCGAAG TCGTCCACCT CGATCCCAGT GTCCGCCGCT CGCTCGGCGG GGCGGAAATC TCGCCGGAAA TGGTCGCCCA GGCCGGCGGA ATGACCATCG AGGAAGTCCG CGCCCTCGGC CACGAAGGCA AGTGCCAGGA CGCGATGATC CGTGGTGCCA TCGCCGCCGC GCACGAATGG GACGCCAGAC ACCCCGTCTC CGGCATTCTC GCGGTCGGCG GCTCGATGGG CTCGGCGCTT GCCGGTGCGC TCATGCAGAG CTTCCCCTAT GGCCTGCCCA AGCTGATCGT CTCGACCATG GCCTCGGGCT TCACCAAGCC CTACATGGGC GTGAAGGACA TCGCGATGAT GAACGCGGTG ACCGATATCT CGGGCATCAA CACGATCAGC CGCGACGTCT TCCGCAACGC TGCCAACGCC GTTGCCGGAA TGGCGAAGGG CTACGACCGC GACAAGGGCC CCGAAAAGCC TCTCGTCCTC ATCACCACGC TCGGCACGAC GGAAACCAGC GTGAAACGCA TCCGCCAGGC ACTGGAAAGC GATGGCTGCG AAGTCATGGT CTTCCATTCC TCCGGCGCGG GCGGCCCCAC GCTCGACGGG CTCGCCGCCG ACAAGGACGT GGCGCTGGTC CTGGACCTTT CCCCGACCGA GATCCTCGAC CACCTCTTCG GCGGCCTGGC TGATGCCGGT CCGGATCGCG GGCGCGCGGC CCTGCGCAAG GGCATCCCGA CGATCCTTGC CCCCGGCAAT GCCGATTTCA TCATCGGCGG TCCGATCGAC GCCGCGGAAG CGCAGTTTCC AGGCCGGCGC TACCACCAGC ACAACCCGCA GCTCACCGCA GTCCGCACCA ACGTCGCGGA CCTTCGGAAG CTGGCCGATC ACCTTGCCGC CAACGTGCGC GAGGCCAAGG GCCCGGTCCG GGTCTTCACC CCGCTCAAGG GCTTTTCCAG CCACGACAGC GAAACGGGCC ACCTGCTCGA CCTCTCGGTG CCGGGACCCT TCGCCGAATA TCTCGCCAGC GTCATGCCAG GTCACGTGCC GGTGACCGCC GTGGACGCCC ATTTCAACGA CGAAGCCTTC TCCAGCGCGG TCATTGCCGC CGCGCGCGAG ATGCTTGCCG CAAAGAACTG A
|
Protein sequence | MTDKPSVLFI CTQDTEEEEA RFTRAALEAA GVEVVHLDPS VRRSLGGAEI SPEMVAQAGG MTIEEVRALG HEGKCQDAMI RGAIAAAHEW DARHPVSGIL AVGGSMGSAL AGALMQSFPY GLPKLIVSTM ASGFTKPYMG VKDIAMMNAV TDISGINTIS RDVFRNAANA VAGMAKGYDR DKGPEKPLVL ITTLGTTETS VKRIRQALES DGCEVMVFHS SGAGGPTLDG LAADKDVALV LDLSPTEILD HLFGGLADAG PDRGRAALRK GIPTILAPGN ADFIIGGPID AAEAQFPGRR YHQHNPQLTA VRTNVADLRK LADHLAANVR EAKGPVRVFT PLKGFSSHDS ETGHLLDLSV PGPFAEYLAS VMPGHVPVTA VDAHFNDEAF SSAVIAAARE MLAAKN
|
| |