Gene Saro_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0740 
Symbol 
ID3918564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp782384 
End bp784855 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content65% 
IMG OID640443472 
Productglycosyl transferase, group 1 
Protein accessionYP_496021 
Protein GI87198764 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCGGC GCGGTGCAAC CCTATCGCCG GACGTGCGTT TGTTCATTGA TCGTACAGGT 
TCCGGCACCG GCCGACCTGC TCAGCATTCG GAACACCACA AAGCCATGAA ATCCAGCAGC
ACGACCTATC GACATGCCGA CGCCGTGCCT CCCGGTTCGG TTCTTTCGTT CCTGCGTCGC
GCCGACGCCG CGGTGTGTCG CCGCATTGCC CTGATCGGCG GCTTCCGTCC CCGCAAATGC
GGGATCGCGA CCTTCACGAC CGACATCTAC GAACAACTCG GCGCCCACCA TCCCGACGTG
GCGGTGGATC TTCATGTGGT GGACGATCCG CTCGCGGTTC ATGTCTACGA AGGCGTGCGA
GGCATCATCA GGTCGGACCG CCCCGAGGAC TATCGCGCCG CTGCGCGGCG GATCAACGAG
GATGCGGTGG ATGCGGTCTG GCTGCAGCAC GAGTACGGCA TCTTCGGCGG CGAGGCCGGG
GAAATGGTAC TCGAACTTGT CGACCGCATA GCTGCACCGC TGATCGTGAC GCTGCACACC
GTTCTGGCAG AACCCTCGGA CAAGCAGCGG GCGGTGCTGG AACACCTGCT ACGTCGCGCA
TCCGGCGTGA TGGTCATGTC CGATCATTCC CGGAAGCTGC TTCGGCAGAT CTATCGCGTT
GACGCCGACC GTATTGCCGT CATCGCGCAT GGCGCGCCGG ACCGCCCGTT TGGCCGTGAG
GAGGAGCACA AGCGTCAGTT CGGGCTGGAA GGGCGTCGCG TGATGATGAC GTTCGGCCTT
CTCGGTCCGG GCAAGGGGCT CGAAACCGTC ATCGAGGCGC TTCCCGCAAT TGCCGAGAAC
CACCCCGACG TGGTCTATCG CATCGTCGGC GCGACGCATC CCAACCTTGT CGCTCGCGAC
GGCGAGCGTT ATCGCGAAGG CCTCATGGAA CTTGCCGAAC GGCTGGGCGT TGCGGCGAAT
GTGCAGTGGG ATAATCGCTT CCTTGATACC GAAGAGCTGC TCGACCAGCT CGAGGCCTGC
GACATCTACC TCACGCCTTA CCCCAACATG CAGCAGTCGA CGTCAGGCAC GCTGAGCTAT
GCCGTTGCGC TCGGCAAGGC CGTGGTCTCG ACGCCGTATG TCCATGCGCG GGAGCTGCTT
GCCGAAGGCG TCGGCGTTCT GGTCGAGCCT CGCCAGGCGG ATGTCATCGC GGCCGCCGTC
AACCGGCTGC TAGACGATCC GCAGGAACTG CGCGCGGTCA AGCGGCGGGC ATGGGAAAAG
GGCCGAGAGA CGATCTGGCC TTGTTTTGCG AGGGGCGCGC GCGATCTGGT CGAGAGCGTA
GCGGTGCAGC CGTGTCGTCC CATGCCTCTG CTTGCGACGC CTGGCTTCGC GGGCGTTGCC
GCGATGGGCG ATGCGACCGG CATCATGCAG CATTCCATCG GAACCGTGCC CGACCGCCGA
CATGGCTATT GCCTGGACGA CAACGTCCGC GCCCTGATGC TCATGGGCGT TGCCGATGCC
GTGCCGGTTG CCGAGCGCCA GCGCTGGGCC ATGATCTATG CCTCGTTCAT CCAGCACGCC
TGGAATCCCG ACCGGCAGGC CTTCCGCAAT TTCATGAACT TCGACCGCAC TTGGTGCGAG
GACGTCGGCT CGGACGACAG CAACGGGCGC ACCGTCTGGG CCCTGGGCGG GGCAATGCGC
AACGCACCGG ACGAGGGCTT GAGGGCATGG GCGAGCCAGT GGTTCGACAT CGCGCTTGCC
CCGGTGGCTG CGATGGAGCC GCCCCGTACG GTCGCCTTCG TCATGCTCGG CTGCGCGCAG
GCCTTGAAGG CCAATCCCGA CCACGCTGCC GCACGGACCA GTCTGGAGCA TGGCGCTAAC
CTGCTTCACG CCCTTCTCGC GTCGTCGCGC AGGCCCGACT GGGCGTGGTT CGAGGCAGTG
CTGGGCTATG ACAACCCACG CCTGTCGCAG GCTCTGATCG AGGCGGGGAC GTTGCTTGGC
CGTGAGGACT GGCTGGAAAG CGGCTTGTCC AGCCTGCGCT TCATTTCGTG CCAGCAGGTG
TCGGCACAAG GTCACTTTCG CCCCGTGGGC TCCGAAACCT TCGGCCGGGC TTACGAGAAC
CTGCCCTTCG ACCAGCAACC TCTGGAAGCA TGGGCTGCGA TCGACGCTGC GACAGCAGCG
TTCGAGGCGA CGGGCGATGA GAACTGGCTG CATCATGCCG AAACGGCCTA TCGCTGGTTC
TTCGGCGGCA ACGATCGCGG GGTTGTGCTT GCAGACATCG CGAGCGGGCG ATGCCGGGAC
GGCGTCACGC CGCAGGGCGT GAATCTCAAT TGCGGTGCGG AGTCCATCCT CGCGTTCCAG
CTGGCGCATT ATTCGATCTG TGCCGTGGCC GGCCGGACTG CCGAAGGGCG TGCAATTCTG
GCTGGGCGCA CCAGCGAGAA GATTGCGGAA CGCATCAATA GCAGGGGTGC TCAACAGCTT
GCGGGTGCTT GA
 
Protein sequence
MLRRGATLSP DVRLFIDRTG SGTGRPAQHS EHHKAMKSSS TTYRHADAVP PGSVLSFLRR 
ADAAVCRRIA LIGGFRPRKC GIATFTTDIY EQLGAHHPDV AVDLHVVDDP LAVHVYEGVR
GIIRSDRPED YRAAARRINE DAVDAVWLQH EYGIFGGEAG EMVLELVDRI AAPLIVTLHT
VLAEPSDKQR AVLEHLLRRA SGVMVMSDHS RKLLRQIYRV DADRIAVIAH GAPDRPFGRE
EEHKRQFGLE GRRVMMTFGL LGPGKGLETV IEALPAIAEN HPDVVYRIVG ATHPNLVARD
GERYREGLME LAERLGVAAN VQWDNRFLDT EELLDQLEAC DIYLTPYPNM QQSTSGTLSY
AVALGKAVVS TPYVHARELL AEGVGVLVEP RQADVIAAAV NRLLDDPQEL RAVKRRAWEK
GRETIWPCFA RGARDLVESV AVQPCRPMPL LATPGFAGVA AMGDATGIMQ HSIGTVPDRR
HGYCLDDNVR ALMLMGVADA VPVAERQRWA MIYASFIQHA WNPDRQAFRN FMNFDRTWCE
DVGSDDSNGR TVWALGGAMR NAPDEGLRAW ASQWFDIALA PVAAMEPPRT VAFVMLGCAQ
ALKANPDHAA ARTSLEHGAN LLHALLASSR RPDWAWFEAV LGYDNPRLSQ ALIEAGTLLG
REDWLESGLS SLRFISCQQV SAQGHFRPVG SETFGRAYEN LPFDQQPLEA WAAIDAATAA
FEATGDENWL HHAETAYRWF FGGNDRGVVL ADIASGRCRD GVTPQGVNLN CGAESILAFQ
LAHYSICAVA GRTAEGRAIL AGRTSEKIAE RINSRGAQQL AGA