Gene Saro_0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0752 
Symbol 
ID3918576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp797758 
End bp799029 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content62% 
IMG OID640443484 
Productglycosyl transferase, group 1 
Protein accessionYP_496033 
Protein GI87198776 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGTCG CCATCGACGC ATTCAACATC GCGCTTGCGC ACGGGACCGG GGTTGCAACC 
TATGGGCGCA CCCTGGCGTC CGCTGCCGCC GGACTGGGGC ATGAGGTGAA CGTCCTGTTC
GGTGCCGGTG CCGGGTACTC GAAGGTTCCG CTCCTCAACG AGATTGCGCT TGCCGAAGTT
GGTGCGGTTT CAGCACGGGG CCCTTCACGC GCGGCATTGG CTCGCGGGAT CGCGCGCGGT
CTTGCCGGTG CGCGTCCGGC GCACGACTTG CCAATTTCTG GGCAGGTCAT TCTGCCTCCA
GGCCTCAGGC TGGATGCCAG GCGTCACTCG AACGTACCGA ACCTCTTCAA GGCCGCAGAT
CTCGGGTTTC GTGCGGCAGG CGCTTTCTCC AAGGTGAAGG TGCCCGGCAC CGATCTGGCG
CACTGGACGT ATCCGCTGCC GCTGAAAATG GTCGGTGCCC GCAACGTGTA TACGCTGCAC
GACCTTGTCC CGCTGCGTCT GCCCTACACC ACAGCGGATG TGAAACACGC CTATTACCGC
TTGTGCCGAC GGATCGCGCG CGACGCGGAT CATATCCTTA CCGTGTCGGA ATGCTCGCGC
CGCGACATCG TCCGGTTGCT GGGCGTCGAT GAAGATCGCG TTACCAACCT CTACCAGACG
AGCGATATCG CCGATGCCAT CGCTGGTGTG AGCGAAGACT TCGTTGCTCG CTACGTCGAG
GGTCTGCTTG GTGTCGGCAT GCGCGAATAC TTCCTCTTCT TCGGCGCAAT CGAACCAAAG
AAGAACGTGG CACGCCTGAT CGAGGCATTT TTGGCAAGCG CAGTGCAATC CCCGCTGGTG
ATCGTTGGCG GGGCGGGATG GGGCGGCGAA CAGGACGTGA AGCTGCTCAA ATCCTTGGCC
GGGATGGACA CGCGCAAGCG CATCGTCTGG CTGGGGTATC TCCCGCGTCA GATGCTGGCC
ATGTTGATCG CAGGCGCAAG AGCCACGGTG TTCCCGTCGC TCTACGAAGG CTTCGGGCTG
CCCGTTCTTG AATCGATGGA ACTGGGCACA CCTGTCATCA CGAGCAATGT CTCGTCCCTG
CCCGAAGTTG CCCAGGATGC GGGATTGCTT GTCGATCCGT ACGATGTTCG CTCGCTCGCG
ACGGCTTTCC TGCAACTGGA TGCCGACCCG GGCCTGCGCA GTCAAATGTC CATGCGTGGG
CGCGAAGTGG CTGCCGGCTT CAGTGCCGAA GCCTATCGGG GGCGGCTCGC GGACTTCTAT
GCCAAGTTCT GA
 
Protein sequence
MRVAIDAFNI ALAHGTGVAT YGRTLASAAA GLGHEVNVLF GAGAGYSKVP LLNEIALAEV 
GAVSARGPSR AALARGIARG LAGARPAHDL PISGQVILPP GLRLDARRHS NVPNLFKAAD
LGFRAAGAFS KVKVPGTDLA HWTYPLPLKM VGARNVYTLH DLVPLRLPYT TADVKHAYYR
LCRRIARDAD HILTVSECSR RDIVRLLGVD EDRVTNLYQT SDIADAIAGV SEDFVARYVE
GLLGVGMREY FLFFGAIEPK KNVARLIEAF LASAVQSPLV IVGGAGWGGE QDVKLLKSLA
GMDTRKRIVW LGYLPRQMLA MLIAGARATV FPSLYEGFGL PVLESMELGT PVITSNVSSL
PEVAQDAGLL VDPYDVRSLA TAFLQLDADP GLRSQMSMRG REVAAGFSAE AYRGRLADFY
AKF