Gene Saro_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3221 
Symbol 
ID3917479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3439260 
End bp3440513 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content61% 
IMG OID640446005 
Productglycosyl transferase, group 1 
Protein accessionYP_498490 
Protein GI87201233 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0404695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCGC CGGTTCCCGC AGCTGACTTC GAGTTCCCCA CCGAAATCCC CGGCCTGCCG 
CTGGCGGGCA GGAAAGTCCT GATCGTCGTC GAGAACCTGC CGCTGCCCTT CGACCGGCGT
GTCTGGCAGG AAGCGCGCAC ACTGAAGGCC GCCGGGGCGC AAGTCTCGAT CATTTGCCCC
ACGGGCAAAG GTTACGAGAA GCGCTTCGAA GTCATCGACG GCATCGACAT CCACCGCCAT
CCCCTGCCCA TCGAGGCGAG CGGCGCACTG GGGTTCCTGC TGGAATATGG CGCGGCCCTG
TTCTGGGAAA CGGTGCTGGC TTGGAAGATA TTCCTCAAGC GCGGCATCGA CGTGATCCAG
GGGTGCAATC CGCCCGACCT GATCTTCCTT GTCGCCCTGC CCTTCAAGCT TCTGGGCGTC
AAATATATCT TCGACCATCA CGACATCAAT CCCGAGCTTT ACGAAGCGAA GTTCGACAAG
CGGGGCTTCT TCTGGAAGTT GATGGTCCTG TTCGAAAAGT TGACGTTCAA GGCCGCCGAC
GTGTCGATGG CCACCAATCA TTCCTATCGC AAGATCGCCA TCGAGCGGGG CGGCATGGAC
CCGGATAAGG TGTTCGTCGT CCGCTCAGGT CCGGATCTCA GCCGGCTGAA GCGGGTACCT
CCGGTCGAAA GCTGGAAGAA CGGGCGCAAG CACCTCGTCG GATATGTCGG GGTGATGGGC
GACCAGGAGG GAATAGACCT TCTAATCGAT GCGGTGGACC ATATCGTGCG CGTGATGGGC
CGAGACGACA TCCAGTTCTG CCTTGTCGGC GGAGGGCCAA GCCTCGCCAA GCTAAAGGCA
CTGGTCGCGG AAAAGGGCTT GGCCGACTTC ATCCAGTTCA CCGGCCGCGC ACCCGATCAG
GACCTGTTCG AAGTTCTTTC GACGATGGAC GTCGGGGTCA ATCCGGACCG CGTCAACGCG
ATGAACGACA AGTCCACCAT GAACAAGATC ATGGAGTACA TGAGCCTCGA GAAGCCCATC
GTGCAGTTCG ACGTGACCGA GGGGCGCTTT TCCGCGCAGG AAGCCTCGCT CTATGCGCGC
GCGAACGATC CGGTCGACAT GGCGGAAAAG ATCGTCGAGC TGATCGGAGA TCCGGAACGA
CGGGCCCGCA TGGGCGCACT CGGCCGCATG CGCGTGGAGA CCGAACTGAA TTGGGGGCAC
CAGATCGCCC CGTTGATCGC CGCGTATCGC AAGGCGCTCT GCCTTGCCGA CTGA
 
Protein sequence
MNAPVPAADF EFPTEIPGLP LAGRKVLIVV ENLPLPFDRR VWQEARTLKA AGAQVSIICP 
TGKGYEKRFE VIDGIDIHRH PLPIEASGAL GFLLEYGAAL FWETVLAWKI FLKRGIDVIQ
GCNPPDLIFL VALPFKLLGV KYIFDHHDIN PELYEAKFDK RGFFWKLMVL FEKLTFKAAD
VSMATNHSYR KIAIERGGMD PDKVFVVRSG PDLSRLKRVP PVESWKNGRK HLVGYVGVMG
DQEGIDLLID AVDHIVRVMG RDDIQFCLVG GGPSLAKLKA LVAEKGLADF IQFTGRAPDQ
DLFEVLSTMD VGVNPDRVNA MNDKSTMNKI MEYMSLEKPI VQFDVTEGRF SAQEASLYAR
ANDPVDMAEK IVELIGDPER RARMGALGRM RVETELNWGH QIAPLIAAYR KALCLAD