Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2098 |
Symbol | |
ID | 3917746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2234649 |
End bp | 2235851 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640444851 |
Product | hypothetical protein |
Protein accession | YP_497371 |
Protein GI | 87200114 |
COG category | [S] Function unknown |
COG ID | [COG3876] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0323698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTTG GCATCGACCG GCTGCTCGCC GACCCCGCTC TCAGGAAACC GCTCGAAGGC CGCCGTATCG CGCTCGTCGC GCATCCGGCC TCGGTGACCG AGGGCCTCGT CCATTCGCTC GACGCGCTGG CCGCGCTGCC CGAGGTTCGC CTTGCCGCCG CCTTCGGGCC GCAGCACGGG CTGAAGGGCG ACAAGCAGGA CAACATGGTC GAGACCGCCG ACGAGCTGGA CCCGACCTAC GGCATCCCTG TGTTCAGCCT CTACGGCGAG GTCCGCCGCC CGACCGCGCG GATGATGGAC AGCGCCGACG TGTTCCTGTT CGACCTCCAG GACCTTGGCT GCCGGATCTA CACTTTCGTG ACGACGCTGC TTTATCTGCT CGAAGCAGCG AGCGGGACGG GCAAGGCGGT CTGGGTGCTC GATCGCCCGA ATCCCGCCGG CCGTCCGGTG GAAGGCACGA CGCTCCTGCC GGGCTGGGAA AGCTTCGTCG GGGCGGGGCC GATGCCGATG CGCCACGGGA TGACCCTGGG CGAAATGGGC GCCTGGTTCG TCGAGCACTT CAAGCTCGAC GTCGATTACC GCGTGATCGC GATGGAAGGC TGGACTCCCG GCGAGGGTCC GGGCTGGGGC TGGCCGGAGA GCCGCATCTG GGTGAACCCT TCGCCCAATG CCGCGAGCCT CAACATGGCG CGGGCCTATG CCGGCACGGT CATGATCGAG GGCGCGACGC TTTCGGAAGG GCGCGGCACC ACGCGCCCGC TCGAGGTGCT GTTCGGCGCG CCCGACGTGG ACGCCAGGGC GGTGCTGGCC GAAATGCGCG GCTTCGCGCC GCAGTGGATG CAAGGCTGCG CGATCCGCGA GTGCTGGTTC GAGCCGACCT TCCACAAGCA CGCGAAGAGC CTGTGCAGCG CGCTGATGAT CCACGCCGAG GGCGCGTTCT ACGATCACCA CGCGTTCCGC CCGTGGCGCT TGCAGGCGCT GGCGTTCAAG GCGATCCGGC GGCTCTGGCC GGACTACCCG ATCTGGCGCG ATTTCCCCTA CGAGTACGTG TTCGACAAGC TGGCGATCGA CGTGATCAAC GGCGGTCCCG CGCTGCGCGA GTGGGTGGAC GACATGGGCA GCGAGGCGGG CGATCTCGAT TCGATGGCCG GGGCGGACGA AGCGGCCTGG ATCGAGGAGC GGCAGCGCTT CCTGCTCTAC TGA
|
Protein sequence | MKFGIDRLLA DPALRKPLEG RRIALVAHPA SVTEGLVHSL DALAALPEVR LAAAFGPQHG LKGDKQDNMV ETADELDPTY GIPVFSLYGE VRRPTARMMD SADVFLFDLQ DLGCRIYTFV TTLLYLLEAA SGTGKAVWVL DRPNPAGRPV EGTTLLPGWE SFVGAGPMPM RHGMTLGEMG AWFVEHFKLD VDYRVIAMEG WTPGEGPGWG WPESRIWVNP SPNAASLNMA RAYAGTVMIE GATLSEGRGT TRPLEVLFGA PDVDARAVLA EMRGFAPQWM QGCAIRECWF EPTFHKHAKS LCSALMIHAE GAFYDHHAFR PWRLQALAFK AIRRLWPDYP IWRDFPYEYV FDKLAIDVIN GGPALREWVD DMGSEAGDLD SMAGADEAAW IEERQRFLLY
|
| |