Gene Saro_3179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3179 
Symbol 
ID3918221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3399971 
End bp3401332 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content65% 
IMG OID640445963 
ProductO-antigen polymerase 
Protein accessionYP_498448 
Protein GI87201191 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.579286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGACC TGTTCCTGCT GAGCTTCGTG CTGGCCTTCA TCGGCGCGGG CTTTCGCCGG 
CCCTTCATCT TCGTGCTGGC CTACTCGTAC ATCGACATCG TGGCCCCGCA GAAGGTGAGC
TGGGGCATCC TCAGCCACAT TCCGGTGTCG CTCATCGCAT TCCTCTGCGC GTTCATCTCG
TGGTTCGTGG CGGAGGACAA GAACGGCATC CGCTTCTCCA TGCGCCAGTT CCTGCTGCTG
GCCCTGCTGG TCTATTGCGG GCTGACGACG CAGACCGCCG ACTTCCCGGC AGAGGCCGCG
GACAAGTGGG CCTGGGTGTG GAAGGCGCTG CTGTGGGCCC TGTTCCTGCC GCTGACGCTG
CGCACGCGCC TGCGCATCGA GGCGATCACC CTGATCCTTG CCCTGTCCAT CGGCGTGATC
GTGATCGGCG GCGGCATCAA GACCGCGGCC GGTGGCGGCG GATACGGCGA ACTGCGCCTG
CTGGTGAACG ACAACACCGG CCTCTACGAG GGCTCGATCA TCTCGGCAGT CGCCATTGCG
GTCATTCCGC TGGCGCTCTG GCTTTCGCGC TTCGGCACGA TCTTCCCGCC CGACTGGAGG
GTGAAGACCT TCGCCTGGGC GCTTTGCTTT GCCTGCGCGC TCATGCCCAT CGGCACCGGG
GCGCGGACCG GCCTTGTCTG CGTGGTGGTG CTGGCCGCGA TGATCCTGCG CACGGCAAAG
CGGCGCTTGC TGATCGTGTC GGTCATGGCC GCAGGCGCCC TGATCGCGGT CCCGCTCCTG
CCCAAGGAGT TCACCGACCG CATGGGCACG ATCCGGAACC ACCAGTCCGA CCAGTCCGCC
GGAACCCGCA TCGCGGTGTG GAAGTGGACG ATAGAGTTCG CCAAGACCCA TCCCTTCGGC
GGCGGCTTCG AGGCATATCG CCAGAACCGG CTGGAATACG ACACGGTCAA GGCCGACTAT
GCCGGCGACA ACAACGCCGC GCTCGAATAC CAGCCCATTG TCGAAGAGGG GCGCGCCTAT
CATTCCAGCT ACTTCGAGAT GCTGGGCGAA CAGGGCTATC CGGGCCTGGC CCTGTGGCTG
GCGCTTCACC TGCTGGGCGT GTGGCAGATG GAACTGCTGA GGCGGCGCTA TCGCAAGGAG
GCATCGAAGG AGTTCCGCTG GGTCGCCCCG CTGGCCGAAG CCTTGCAGCA GGCCCAGGTG
ATCTACCTCG TCGGCTCAAC CTTCGTCGGC ATCGCGTTCC AGCCGTTCTG CTACATGCTG
GTGGGCCTCC AATGCGGGCT CTGGGCCTAT ATCAAGCGGG TCCGCACAGC CACGCCTGAG
CCGTTCCGCA AGGCTTCAAC CCCGGTGACG GCACCCGCTT AA
 
Protein sequence
MLDLFLLSFV LAFIGAGFRR PFIFVLAYSY IDIVAPQKVS WGILSHIPVS LIAFLCAFIS 
WFVAEDKNGI RFSMRQFLLL ALLVYCGLTT QTADFPAEAA DKWAWVWKAL LWALFLPLTL
RTRLRIEAIT LILALSIGVI VIGGGIKTAA GGGGYGELRL LVNDNTGLYE GSIISAVAIA
VIPLALWLSR FGTIFPPDWR VKTFAWALCF ACALMPIGTG ARTGLVCVVV LAAMILRTAK
RRLLIVSVMA AGALIAVPLL PKEFTDRMGT IRNHQSDQSA GTRIAVWKWT IEFAKTHPFG
GGFEAYRQNR LEYDTVKADY AGDNNAALEY QPIVEEGRAY HSSYFEMLGE QGYPGLALWL
ALHLLGVWQM ELLRRRYRKE ASKEFRWVAP LAEALQQAQV IYLVGSTFVG IAFQPFCYML
VGLQCGLWAY IKRVRTATPE PFRKASTPVT APA