Gene Saro_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0744 
Symbol 
ID3918568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp788181 
End bp789335 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content62% 
IMG OID640443476 
Productglycosyl transferase, group 1 
Protein accessionYP_496025 
Protein GI87198768 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.348162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAACAG TTTGTATTGA TTGCAGGTAT ATAGGTCCTA GGCCCAGCGG TATCGCAAAG 
GTCGAGGCGG CATTGGTCGA GTTTGCGCCG GAGTTGGCGC CTGAACTTGA ATTTCTTTTG
CTGAAGAGTC CTTCGGCGCC CCGGCGCCTC AGCCATGCGG CCAATGTGAC CGAAGTGGTG
GTCGGCGCAG CGGCCAATAG TCCCGCGACC ATGTGGTGGT TGCCGAGGAT CGTCGATCTC
TCAAGGGTCG ACTTGTTTCA TGCGACCTTC AACATCTTGC CGGCGGGCCT TGCCATGCCC
TGCGTGACGA CGGTCCACGA CATCATGTGG CTGACCCGCC AGGAATGGTG CAATGCTCGT
CTTTCGCGTC CGCTCGAACG CCGATTCTAC CGGCACGGCA TCGCACGCGC GCTCCGGAAT
TCGGCGGCGG TTGCCACTGT CAGTGAAGCG AGCCGCTCAG AAATTGCGAC GCATTTTCCG
GACGTTATTT CGCGTCTCCG CGTGACGTCG CCTGGCGTCG GGCCTGCCTA TAGCCCTGGG
CAGGTAACCA AGGAGCAGCT TGCGGGCCTG GGAATACCCG AGGGCCGCAA GGTGGTGCTC
ACCGTCGGTC AGTACGTGCC CTACAAGAAC CATGAAGGCG CATTGCGCAT CTTCGCCAAG
GCTTTCGCCG GTCGTGACGA TGTCGTGATG GTCTTCGTGC AGCGGTTGTC CCGCAACGCG
GAGCGACTGC GGGCGCAGGC GCGTCATCTT GGAATTGCCG ATCGCGTGCA TTTCCTCGGT
GCGCTCGACG ACGACGAACT CACTGCGTTT TACCGCAGCG CTTCGGTTTT GTTGCATCCC
TCGTTCTGCG AGGGCTTCGG CCTGCCCCTC GCCGAAGCGA TGGCCTGCGG TTGTCCCGTT
GTTGCCTCCG ATTGCTCCGC GATGCCCGAA GTGCTTGGCG ACGCGGGTAT GCTGGCGCCG
GTCAATGATG AGGGCGCTCT GGCGCAGGCA TTGCGGCGCG TGGTGGATGA CGCGGTCCTT
GCCCGACGCC TCGGTCGCGC AGGCATGGCC CGCGCCGCGA ACATGCGTTG GCGCGAATTC
GCGCGTGCGA ACGTGGACAT CTACCGCGAA GTGCTCAGGA ACGCTCAGCG GAGTTCCGAA
TTTGCGCGGC CTTAA
 
Protein sequence
MPTVCIDCRY IGPRPSGIAK VEAALVEFAP ELAPELEFLL LKSPSAPRRL SHAANVTEVV 
VGAAANSPAT MWWLPRIVDL SRVDLFHATF NILPAGLAMP CVTTVHDIMW LTRQEWCNAR
LSRPLERRFY RHGIARALRN SAAVATVSEA SRSEIATHFP DVISRLRVTS PGVGPAYSPG
QVTKEQLAGL GIPEGRKVVL TVGQYVPYKN HEGALRIFAK AFAGRDDVVM VFVQRLSRNA
ERLRAQARHL GIADRVHFLG ALDDDELTAF YRSASVLLHP SFCEGFGLPL AEAMACGCPV
VASDCSAMPE VLGDAGMLAP VNDEGALAQA LRRVVDDAVL ARRLGRAGMA RAANMRWREF
ARANVDIYRE VLRNAQRSSE FARP