Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3404 |
Symbol | |
ID | 5077553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 4890 |
End bp | 6392 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481128 |
Product | hypothetical protein |
Protein accession | YP_001165790 |
Protein GI | 146275630 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.552289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATCC TCCCCCATAA GTCGTTCGCT CTCCCCCTCG CCGCGCTTGC CCTCGCTGCA TCGATGGGCG CGCGTGGCGG GGAAACCCCC GACCGCTGGT GGGCACCGGG CGAAGGCCGG GTGCTCCCGG CGGTCCTCGA TTACGAGAAC GACCACGGCA CGCTGCGCAC GCTTGTCGTG GGCGGTCCGC TGAAGACGAA AGGCCACCCG TTCTTCGAAC CGCTCGGCCC CAATGGCCGT GCTTGCGTGA CCTGCCACCA GCCCGCCGAT GCGATGAGCC TGTCGGTCGA AAGCGTCCGG CGCCAATGGG ATCGGACGAA GGGCAAGGAT CCGCTCTTCG CCGCAATCGA CGGTTCGAAC TGCCCCACCC TTCCACAAGA AGCGCGTGAT TCGCATTCGC TGCTGCTCGA TCGCGGGCTC TTCCGGATAG AGCGCCCGTG GCCGGTCACC AGCTTCAACG GCAGGCCGGT AACGCCCGAT TTCACCATAG ATGTGGTGCG CGATCCCAAC GGCTGCAACT CCGGCCCGGC CTATGGCCCC GCAGCCGGCA AGATCTCGGT CTATCGTCGC CCGCGCCCGG TCGCGAACAT GAAGTACCTG CTCGCCGTCG GCTTCCCCTA CGATCCCAAG CAGGGATACG CGCTCCCGCT CGATCCCGAT GACGGCAAGC CCCAGTCCGG AAACCTCATG GCCGACAACC GCGCGGGCAA CCTGCGCCTG CAGATGGAAG ATGCGGCCAG CAGCCACCTC CAGATGCTGA AGCGCCTGGG CCCCGCCCAG CGGAAGCGGC TTCAGGACTT CGAACTGCGC GTCTTCACCG CAATGCAGGT CAGCAAGACC GGCGGCGCGG TCGACACGCT CGGCGCCAAG GCAGGCCCCG CCCGCCTGCG CGACAGCCAG CCCGGCGCGC TCGGGTCCAT CGGCGAGCCA GTGTGGAGCG AATTTGCCGG CTGGGAGAAG ATTTCGCCGG ACGACGCGGC AAAGCTCACG CCCCGGCATC TCGCGTTCCG CCAGTCGGTC GCGCGCGGGG CAAGGGTGTT CCGCGACAAG ACCTTCCTCA TCACCGATAC TGCGGGCATC AACTCGCGGA TCGGCTTCGG CAACCCGGTG CGCAACTCCT GCGTATTCTG CCACAACATG AGCCAGATGG GGAACGATGT CGCCCCGGGC CAGGTGGACC TCGGCACGAC GACGCTGCCC TTTGCCGATC CGTGGGACGA CCTTCCGCTG TTCCGCATCA CCTGCACGGG CCGCCCGCAT CCGCACTATG GCCGCGTGAT CTACACCTAC GATCCGGGCT TCGCGCTGAC CTCGGGCAAG TGCGCGGATG TCGGCAAGAT CACGCTCCAG TCCATGCGCG GCCTTTCCGC GCGAGCCCCC TATTTCTCGA ACGGTCTGGC AAGGGACCTT CGCGGGATCG TCGACTACTA CGAGCGCCGC TATTCCATCG GCTATACCGA GCAGGAGAAG CAGGACCTCG TCAACCTGAT GAGCGTGCTG TGA
|
Protein sequence | MAILPHKSFA LPLAALALAA SMGARGGETP DRWWAPGEGR VLPAVLDYEN DHGTLRTLVV GGPLKTKGHP FFEPLGPNGR ACVTCHQPAD AMSLSVESVR RQWDRTKGKD PLFAAIDGSN CPTLPQEARD SHSLLLDRGL FRIERPWPVT SFNGRPVTPD FTIDVVRDPN GCNSGPAYGP AAGKISVYRR PRPVANMKYL LAVGFPYDPK QGYALPLDPD DGKPQSGNLM ADNRAGNLRL QMEDAASSHL QMLKRLGPAQ RKRLQDFELR VFTAMQVSKT GGAVDTLGAK AGPARLRDSQ PGALGSIGEP VWSEFAGWEK ISPDDAAKLT PRHLAFRQSV ARGARVFRDK TFLITDTAGI NSRIGFGNPV RNSCVFCHNM SQMGNDVAPG QVDLGTTTLP FADPWDDLPL FRITCTGRPH PHYGRVIYTY DPGFALTSGK CADVGKITLQ SMRGLSARAP YFSNGLARDL RGIVDYYERR YSIGYTEQEK QDLVNLMSVL
|
| |