Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1286 |
Symbol | |
ID | 3917917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1328422 |
End bp | 1329735 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640444022 |
Product | capsule polysaccharide biosynthesis |
Protein accession | YP_496564 |
Protein GI | 87199307 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.395245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCTC CATCACCAAG GCCATTGCTG GAAGTTCCCC GGTTCCCGGG CAGCAGCCCT GCATGCCTCC AGCAGCCCAG AGCCGGAGCA AGGCTGGACA TCGACGCGGC CGAACTGCGC CGCATGTGTG ATCGTCTCGG CCTGAGCGGC TCCGCATATC CTGAGGAGGA TCTGCGCCGC CGCTTTCTCG AAGGATGGGC ACTCGTCGAT CCCTTCGGCG GGCCGCCGCC CTCGCCCCGG ACCGCCATCG AACTGTTCGC GTCATGGCGC AGGCTTTTCA GCGCCAACCG GCAGATCGCC AGTATCCAGG GAATCGCGAG GTGGAAGCGC GCGGCGCTGG CCCCGCTTCT CTGGGACGGA AGACGCGACG TTCCATTCGA TCAGCCGCTG GCGCCCGGAG GAACGACCGC GATATGGCGC GCCCGCACGT CAACAAGGGC CCTGCGGGCA ATCGACGCCA GCGGCGGGCA GCGGCTGGAG ATCGAGGACG GCTTCATCCG CTCGGCCGGC CTTGGCGCGG ATTGCGTGCC GCCGCTGTCC ATCGTCGTCG AGCGGGACTT CGCGCACTAT GACCCGTCGG GGCCGAGCGG GGTCGAGCGG CTGGTTGCGA AGGGCGGCTT CGATGGCGAC CTGCTGGCGC GCGCCGCCCG GCTGCGGACG CGAATCGTTT CGCTGGGCAT CGGCAAGTAC GGCGCATCGA GCATGCGCTT CGCTCGGCCA GGCGGGGCGC GGCGGCACCT GCTGGTCATC GGACAGGTGG CCGACGATCT CTCGCTGCGC CTTGGCGGGG CCGGGCTGGA CAACATGGCG CTACTGCGCC GCGTCCGGCT TGCGGCGCCG GATGCGTTCA TTCTCTACCG CCCGCACCCG GACGTCACCG CCGGTCACCG CGCCGGGCAT GTTCCCGACG GAGAGGCGCT GGCCTTTGCC GACATGGTCG CGCGGGAGCC ACCAATCGCG GCGCTGATCG AAGCGGCCGA CGAAGTCCAT GCCATCACGT CGCTTGCGGG ATTCGAGGCA TTGCTGCGGG GCAAGCGCGT GGTGACCCAT GGCGTGCCGT TCTACGCAGG CTGGGGGCTG ACCACGGATC TCGGGCCCGT CCCTGCGCGC AGGATGGCAC GACGCAGCAT CGATGAGCTG GTTGCGGCCG CGCTGCTGCT GCATCCGCGT TACCTCGACC CGCTGACCCG GCTTCCCTGC CCGGTGGAAG TTGCGGTGGA GCGCGTTGCC GGAGGGGCCG GGATGGGCAG CGAACTGCTG GTCGGCCTGC GCCGGAGGTG GGGCTCTGTG CGGCGCGCGG CACGATGGAA CTGA
|
Protein sequence | MPSPSPRPLL EVPRFPGSSP ACLQQPRAGA RLDIDAAELR RMCDRLGLSG SAYPEEDLRR RFLEGWALVD PFGGPPPSPR TAIELFASWR RLFSANRQIA SIQGIARWKR AALAPLLWDG RRDVPFDQPL APGGTTAIWR ARTSTRALRA IDASGGQRLE IEDGFIRSAG LGADCVPPLS IVVERDFAHY DPSGPSGVER LVAKGGFDGD LLARAARLRT RIVSLGIGKY GASSMRFARP GGARRHLLVI GQVADDLSLR LGGAGLDNMA LLRRVRLAAP DAFILYRPHP DVTAGHRAGH VPDGEALAFA DMVAREPPIA ALIEAADEVH AITSLAGFEA LLRGKRVVTH GVPFYAGWGL TTDLGPVPAR RMARRSIDEL VAAALLLHPR YLDPLTRLPC PVEVAVERVA GGAGMGSELL VGLRRRWGSV RRAARWN
|
| |