Gene Saro_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1286 
Symbol 
ID3917917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1328422 
End bp1329735 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content70% 
IMG OID640444022 
Productcapsule polysaccharide biosynthesis 
Protein accessionYP_496564 
Protein GI87199307 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.395245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCTC CATCACCAAG GCCATTGCTG GAAGTTCCCC GGTTCCCGGG CAGCAGCCCT 
GCATGCCTCC AGCAGCCCAG AGCCGGAGCA AGGCTGGACA TCGACGCGGC CGAACTGCGC
CGCATGTGTG ATCGTCTCGG CCTGAGCGGC TCCGCATATC CTGAGGAGGA TCTGCGCCGC
CGCTTTCTCG AAGGATGGGC ACTCGTCGAT CCCTTCGGCG GGCCGCCGCC CTCGCCCCGG
ACCGCCATCG AACTGTTCGC GTCATGGCGC AGGCTTTTCA GCGCCAACCG GCAGATCGCC
AGTATCCAGG GAATCGCGAG GTGGAAGCGC GCGGCGCTGG CCCCGCTTCT CTGGGACGGA
AGACGCGACG TTCCATTCGA TCAGCCGCTG GCGCCCGGAG GAACGACCGC GATATGGCGC
GCCCGCACGT CAACAAGGGC CCTGCGGGCA ATCGACGCCA GCGGCGGGCA GCGGCTGGAG
ATCGAGGACG GCTTCATCCG CTCGGCCGGC CTTGGCGCGG ATTGCGTGCC GCCGCTGTCC
ATCGTCGTCG AGCGGGACTT CGCGCACTAT GACCCGTCGG GGCCGAGCGG GGTCGAGCGG
CTGGTTGCGA AGGGCGGCTT CGATGGCGAC CTGCTGGCGC GCGCCGCCCG GCTGCGGACG
CGAATCGTTT CGCTGGGCAT CGGCAAGTAC GGCGCATCGA GCATGCGCTT CGCTCGGCCA
GGCGGGGCGC GGCGGCACCT GCTGGTCATC GGACAGGTGG CCGACGATCT CTCGCTGCGC
CTTGGCGGGG CCGGGCTGGA CAACATGGCG CTACTGCGCC GCGTCCGGCT TGCGGCGCCG
GATGCGTTCA TTCTCTACCG CCCGCACCCG GACGTCACCG CCGGTCACCG CGCCGGGCAT
GTTCCCGACG GAGAGGCGCT GGCCTTTGCC GACATGGTCG CGCGGGAGCC ACCAATCGCG
GCGCTGATCG AAGCGGCCGA CGAAGTCCAT GCCATCACGT CGCTTGCGGG ATTCGAGGCA
TTGCTGCGGG GCAAGCGCGT GGTGACCCAT GGCGTGCCGT TCTACGCAGG CTGGGGGCTG
ACCACGGATC TCGGGCCCGT CCCTGCGCGC AGGATGGCAC GACGCAGCAT CGATGAGCTG
GTTGCGGCCG CGCTGCTGCT GCATCCGCGT TACCTCGACC CGCTGACCCG GCTTCCCTGC
CCGGTGGAAG TTGCGGTGGA GCGCGTTGCC GGAGGGGCCG GGATGGGCAG CGAACTGCTG
GTCGGCCTGC GCCGGAGGTG GGGCTCTGTG CGGCGCGCGG CACGATGGAA CTGA
 
Protein sequence
MPSPSPRPLL EVPRFPGSSP ACLQQPRAGA RLDIDAAELR RMCDRLGLSG SAYPEEDLRR 
RFLEGWALVD PFGGPPPSPR TAIELFASWR RLFSANRQIA SIQGIARWKR AALAPLLWDG
RRDVPFDQPL APGGTTAIWR ARTSTRALRA IDASGGQRLE IEDGFIRSAG LGADCVPPLS
IVVERDFAHY DPSGPSGVER LVAKGGFDGD LLARAARLRT RIVSLGIGKY GASSMRFARP
GGARRHLLVI GQVADDLSLR LGGAGLDNMA LLRRVRLAAP DAFILYRPHP DVTAGHRAGH
VPDGEALAFA DMVAREPPIA ALIEAADEVH AITSLAGFEA LLRGKRVVTH GVPFYAGWGL
TTDLGPVPAR RMARRSIDEL VAAALLLHPR YLDPLTRLPC PVEVAVERVA GGAGMGSELL
VGLRRRWGSV RRAARWN