Gene Saro_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1626 
Symbol 
ID3918734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1699641 
End bp1701191 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content63% 
IMG OID640444366 
Productglycosyl transferase, group 1 
Protein accessionYP_496900 
Protein GI87199643 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0797094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGAC GCGAAAGCGA TGTCTGCGTC ATCGTCGAAG GCGCCTATCC CTATGTTACG 
GGCGGCGTTG CAAGCTGGCT TCAGGAATTG ATCACCAGCT TGCCGGAACT GACCTTTTCC
GTCGTGGCGA TCAAGGCCGA CGAGGAACCA CAGAAATGGA ACGTCGAACC GCCCCCAAAC
GTGATCGAGG TCGTGGAGGT CCCGCTGTCG TTCGCCCCGC GCAGACCGGC CGCGCTACCG
CCGAGCCTTG CCGACCGGAT AGGGCGCCTG CTCCTGCGTT TCCTTCAGGA GGGCCAGCCC
GAAATACTGC GAACCCTGGT GGCCGAGCTG GCCGCGCTCG ATCGCAAGCC GCATCCTGGC
GATGTCATGT CCAGCGCGCA GATGTTCTCG ATCCTCACGG AACATTACCG CGAAGCATTC
CCCTCCGCTT CGTTCCATCA TTTCTTCTGG GCAACCCGGA TCCTCTTGGG AGGCTTGCTC
GCCGTTCTTC TGGCGCCGCT GCCGAGGGCA CGCACCTACC ACACCCTGTC CACCGGGTTC
GCTGGCCTGC TTGCGGCGCG CGCGCGGCAT GAAACCGGAC GCCCCGCGTT CCTTACCGAG
CACGGCATCT ATCTGCTCGA GCGGCAGATC GAGATCATGA TGGCGGAATG GATGGGGGAT
CAGATCGACA ACGGACTGGC GCTTGAACGC GAACAGCATG ATCTGCGCGA CCTGTGGGCG
GCTGCGTTCG AAAGCTACGC CCGAGGGTGT TACGACGTCT GCCATCCGAT CATTGCGCTT
TACGGCGCCA ACAGCGAAGT CCAGGCGCGC ATGGGGGCTC GGCGCAAGAG CCTGCGGGTC
ATCCCCAACG GCATCCGGCC TGAACGCTTC GAGGGCGTAG TCTCCCGCCG TGACGAACAG
CGGCCGCTCA TTGCTCTCAT CGGCCGCGTC GTACCTATCA AGGACATCAA GACCTTCATT
CGTGCCGCAG GTCTTGTTCA CGCGGCATTC CCCGATGCGC GCTTTGCGGT GCTCGGCCCC
CGGGATGAAG ACGTGGATTA CGCTCTCGAC TGTACTGCGC TTGTCGACGA ACTCGGACTC
GGAGACGTGA TCGCGTTTCC CGGCCGGGTC AATGTGGTCG ACTGGATGCC GAAGATCGAC
ATACTCGTCC TGACCAGCCT ATCGGAAGCC CAGCCGCTGG TCATTCTCGA GGCAGGCGCA
TGCGGCATTC CATCGGTTGC GCCCGATGTT GGCAGTTGTC GCGAACTGAT CGAAGGCAAT
AAGCCGGGCG AACCCCATGG CGGTATCATC ACGGCTCTTG TCGATCCCGA GGCGACCGCC
GCAGCGCTTC TGCGTCTGCT GCGCCATCCT GATTTGCGCG CCTCGATGGG CGAGGTGATG
CGGGCGCGGG TCCATGCGGA TTACGACTGG TCGGGTATCG TGGAACAGTA TCGTCGTATC
TATTCGGGGA AGGAGGAGCC TCGCCAAATG GCGGTGGTCG GTTCGCCACC GGTCCCTCGT
GCGACCTTGC GCGATGTCGC GAACGCGATT CGCGGACCGT CGAAGGGGTG A
 
Protein sequence
MTGRESDVCV IVEGAYPYVT GGVASWLQEL ITSLPELTFS VVAIKADEEP QKWNVEPPPN 
VIEVVEVPLS FAPRRPAALP PSLADRIGRL LLRFLQEGQP EILRTLVAEL AALDRKPHPG
DVMSSAQMFS ILTEHYREAF PSASFHHFFW ATRILLGGLL AVLLAPLPRA RTYHTLSTGF
AGLLAARARH ETGRPAFLTE HGIYLLERQI EIMMAEWMGD QIDNGLALER EQHDLRDLWA
AAFESYARGC YDVCHPIIAL YGANSEVQAR MGARRKSLRV IPNGIRPERF EGVVSRRDEQ
RPLIALIGRV VPIKDIKTFI RAAGLVHAAF PDARFAVLGP RDEDVDYALD CTALVDELGL
GDVIAFPGRV NVVDWMPKID ILVLTSLSEA QPLVILEAGA CGIPSVAPDV GSCRELIEGN
KPGEPHGGII TALVDPEATA AALLRLLRHP DLRASMGEVM RARVHADYDW SGIVEQYRRI
YSGKEEPRQM AVVGSPPVPR ATLRDVANAI RGPSKG