Gene Saro_0745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0745 
Symbol 
ID3918569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp789614 
End bp790894 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content64% 
IMG OID640443477 
Productglycosyl transferase, group 1 
Protein accessionYP_496026 
Protein GI87198769 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTCGC GCGAGTTTTT GCTGGACGTC AGTCGTCTCG TCTGGCGCAA CTGGTCGCAT 
CGTCTGGCTA CCGGGATCGA CCGGGTCTGC TATGCCTATC TACGCAACTT CGGTCCTCGA
TCCCAGGCGG TCGTGCAGTA CCGCGGCGTT GCCCGCATCC TGACCGAGCG GCATTCCGAC
GAACTGTTCG AAACGCTGCT CGAGCCGGAC GAGGTGTTCC GGCGCAGGCT TTCCCTGCTG
GCGCCCCGCG CGCTTGCCGC CTCGGCTCGC GCTGTCGATG GCCGGGGTAC GTTCTACATC
AACGTCGGGC ACACGGACAT CGACCTGTCA CGCCTGCTCG GATGGACCAG GCGTTCGCGC
GTCAACCCGA TCTATCTGAT TCATGATCTG ATCCCTCTCA CGCATTCGGA GTTCTGCCGC
TCGGAAGCGA TCGAGCGTCA TCGTGGGCGC GTGGTCAATG CCCTGTTCTC TGCTGCGGGC
GTGATCGCCA ATTCACGTGC CACTGCTGCC GAGTTGGAGT TGTACGGCCG CAGTCACGGC
ATTCCGCTGC CGCCGATCAC GTCGGCATGG CTTGCGGGAG CGCGCCTTGC GCAGGACGAC
GTTGTCCCCC TCAAGAGCGG GCGGCACTTC GTCTGCGTGG GAACGATCGA GGGGCGCAAG
AACCACTTCA TGCTGCTCCA GGTCTGGCAG CGGCTGGTCG AACGTCTCGG ACCGGCCGCG
CCCAAGCTCG TTCTCATCGG CCAGAAGGGC GCCGAGGCCG CACACGTCGA GAGCATGCTC
GAACGCGGCC GGGGCATGAG CGACCACGTA GTGATCCTTT CGCATTGCCC CGACGAGGAA
CTGGGGCGCT GGATACGTAC CGCACGGGCG CTCTTGCTGC CTTCATTCGC CGAGGGGTTC
GGGCTCCCGG TGATCGAGGC CATGGAACTC GGCACTCCGG TCATTGCCAG CGATCTTCCC
TGTTTTCGGG AGATCGGCGT CGGCATTCCC ACTCTTCTCG ATCCGCTCGA CGCTGTCGCT
TGGGAGCGCA CGATCCTATC CTTCCTCGAC CTCTGCCCCG AGCGTGCGCG CCAATTGCGT
ATGCTGAAGG AATATAGCGC GCCGACGTGG GGCGGCCATT TCGCCCAGGT GGAGTCATGG
CTTGAGGAAC TGAGGACGGT ACGGCGTCCT TTCATGTTCC ATCCTGGCCC GGACAGACGG
TCCACGCCCG TGGTCGTTCG GCGTGACGGC GCAGAGCGTC ATGCCCCAGA CCGCCTGCAA
ACCTCCCGGC CCGAACGATG A
 
Protein sequence
MISREFLLDV SRLVWRNWSH RLATGIDRVC YAYLRNFGPR SQAVVQYRGV ARILTERHSD 
ELFETLLEPD EVFRRRLSLL APRALAASAR AVDGRGTFYI NVGHTDIDLS RLLGWTRRSR
VNPIYLIHDL IPLTHSEFCR SEAIERHRGR VVNALFSAAG VIANSRATAA ELELYGRSHG
IPLPPITSAW LAGARLAQDD VVPLKSGRHF VCVGTIEGRK NHFMLLQVWQ RLVERLGPAA
PKLVLIGQKG AEAAHVESML ERGRGMSDHV VILSHCPDEE LGRWIRTARA LLLPSFAEGF
GLPVIEAMEL GTPVIASDLP CFREIGVGIP TLLDPLDAVA WERTILSFLD LCPERARQLR
MLKEYSAPTW GGHFAQVESW LEELRTVRRP FMFHPGPDRR STPVVVRRDG AERHAPDRLQ
TSRPER