Gene Saro_0750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0750 
Symbol 
ID3918574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp794450 
End bp796771 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content61% 
IMG OID640443482 
Productglycosyl transferase, group 1 
Protein accessionYP_496031 
Protein GI87198774 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATGG TTCTGCCCGA CATCCTGAAC CCGGACTACC AACGAGCGCG AGGGGACGCA 
GCCCGCGATG CCGGCGACTG GTTGGCCGCA GCGAAGTGTT ACTCCCGCTA CCTGCGCTTT
CGGCGGCGGG ATGCCGGGAT ATGGGTTCAG CTTGGCAATG TGCTCAAGGA ATGTGGCCGT
TTTCAGCAAA GCGAACGGGC ATACCTCCGG GCGATGCGGC TGGGGCTCAA TGATTCCGAT
CTGCATTTGC AGTTGGGGCA CCTGTCCAAG GTGCGCGGCC AGGCGGGTTC TGCCCGGCAC
CACTACCTGA CAGCCATTCG CCGCCAACCG ATGAGCGTCG ATGCCTTCGA GGAACTTGTG
AAGATGGGAC TTGAGGCTGA AGCCAACACG ATCCTCCTGA CCGAGCGGAA GACCGGGGAG
GAGCCGGGGG AGATGCTGAT CTTCGATGTG TCAGACCTGA TCTATTACGT CGGGCATCAC
GACAACCTTT CCGGCATACA GCGGGTCCAG TGCTGCATAA TCCAGGCGAT CGTGAAGTAT
GGGTTGAAAC CGCTCGATCA AATCGGTTTC ATTTCATTCA ACAGGCGATC GAACGGGTTC
ATGATGCTCG ACCGTACGAG GTTCCTCGCC TGGCTTGACG ATTTGTCGTT GCCGCCGGAG
CAGCGCTGCG TTCCGTTCGA TGTCGCCGAG GCGCGGGAGG GTCGAATTTT TCCGATGGGG
CCGCTGGCTG CGTTCATCCA GCCTGAGCGC ACCACCCTGG TCCTGCTGGG CGCCGCATGG
GTGACACCCG ATTACCCCAC CAAGATCGCC AATTTAAAGC GCTTGTTTTC AGCGCGCTTT
GTCATGGTGT TCCATGACTT CATACCGATC TTCGCAAAGG AGACTTGCGA CCAGGGGACG
GTCGAAGTCT TCAAGGAGTT CATCGATCAG ATACTGCCGA TCACGGACGT GGCGCTGTGT
GTTTCGCGGA ACACACGGGA TGACCTGCAC CGCTATTGCG CCAACGCTGG AATATGCGCG
CCGCCGGCAT TGGTGACGCG CCTCGGGTCC GGATTTCATG AATTCTTCCC CGAAGTTTCC
GACCGGTCTG CACTGCCCGT GCCGGCAGGC CGCAGGGGAA CGCCCTACGT CCTGTTCGTC
TCGACCATCG AAGGGCGCAA GAACCACCAA TACCTGTTCG ATGTCTGGGA CGAACTCGCC
AGGCGAGGCG TCGACACGCC CCGGATACTG TGTGTCGGCA GGCTCGGCTG GCGGGCTGAA
CCCGCCATCA TCAAGCTCAT TGAGACGGAC TATCTGGGCG GAAAGGTCGA AATCCGGGAG
GACGTCAGCG ATATCGAGCT CAAGGCCCTC TACGAAGGTG CGCTGTTCAC CGTCTACCCA
AGCATTTATG AGGGTTGGGG GCTCCCCGTC AGCGAAAGTC TCGCGAACGG CAAGGTCTGC
GTTCTGGCGG ATCGCACCTC TCTTCCGGAA GTCGCGGGAG AATTCGGGGT CTATGTTCCC
CTTGATGATC CTGCCGCCGC GGCAGACGTT GTTGCGGAGC TTTTGTCCCG CCCTGGCGAA
CTTGCGCGGC GCGAGGAGGC AATCCGCGAG AAATTCCAGC ATACGAGCTG GCGCATCGTG
GCCGAGGTGG CCTTGCAGGG CTGCACGCTG GCGCGAACCA ATCCCGTCCG GTCGGCGTTG
CCCACCGTCA GGGCCGCGGT CGAATACCCG GTTCGCAGTT TGCGGATGTC GACGCAGGGG
TTGATGGGTA GCGCAATGAT GGATGCGCTT GAGCAGGCGC ATGCCGCATT GATTTTACCC
GGTAGCGTCA AGTCGGCGCA CAAGGTCTCC GGGCTGCTGT GTCGCAACTC GGACTGGTAC
GCGCCTGAAG ATTGGGGAAG CTGGGCAAGG GCGCGAAAGG CTCGCGTCCA GTTCGCGGTC
GAAGCGTCCG AATTCGATGG CGAGGATGAA GTCCTCATCT ACCTGGCGCT GCGCTTCCTT
GAGCCGGCAT TGCCTGCGAC CGTGCGTATC ACCTTGAGTG GTTGCGGTCG ATCGCACCGC
CAACAGGTGC GTACGGAAGA GGCCATGATG GTCTGGCCGG TCCATGCTCG GGATTTTGGC
AGTGCCATCG CCTCAGACAG CGACAGGCTC GCCTTGGAAT TGCAGCTCGA AATCGTCGGG
ATGGACGCTT CTGTCGAGGC CGCGTGCCGC ACCATGGATT CGCGGGAACT GGTGTTCGGG
CTGCGCAGCT TCTGCATAGT GAAAGGCAGC GACACCGTGC AGCGGCTGAG AATTGCCGAA
AGGCAGGGCT ACCGGTCGAC AGTTGGCATG GGAGAGGCTT AA
 
Protein sequence
MPMVLPDILN PDYQRARGDA ARDAGDWLAA AKCYSRYLRF RRRDAGIWVQ LGNVLKECGR 
FQQSERAYLR AMRLGLNDSD LHLQLGHLSK VRGQAGSARH HYLTAIRRQP MSVDAFEELV
KMGLEAEANT ILLTERKTGE EPGEMLIFDV SDLIYYVGHH DNLSGIQRVQ CCIIQAIVKY
GLKPLDQIGF ISFNRRSNGF MMLDRTRFLA WLDDLSLPPE QRCVPFDVAE AREGRIFPMG
PLAAFIQPER TTLVLLGAAW VTPDYPTKIA NLKRLFSARF VMVFHDFIPI FAKETCDQGT
VEVFKEFIDQ ILPITDVALC VSRNTRDDLH RYCANAGICA PPALVTRLGS GFHEFFPEVS
DRSALPVPAG RRGTPYVLFV STIEGRKNHQ YLFDVWDELA RRGVDTPRIL CVGRLGWRAE
PAIIKLIETD YLGGKVEIRE DVSDIELKAL YEGALFTVYP SIYEGWGLPV SESLANGKVC
VLADRTSLPE VAGEFGVYVP LDDPAAAADV VAELLSRPGE LARREEAIRE KFQHTSWRIV
AEVALQGCTL ARTNPVRSAL PTVRAAVEYP VRSLRMSTQG LMGSAMMDAL EQAHAALILP
GSVKSAHKVS GLLCRNSDWY APEDWGSWAR ARKARVQFAV EASEFDGEDE VLIYLALRFL
EPALPATVRI TLSGCGRSHR QQVRTEEAMM VWPVHARDFG SAIASDSDRL ALELQLEIVG
MDASVEAACR TMDSRELVFG LRSFCIVKGS DTVQRLRIAE RQGYRSTVGM GEA