Gene Saro_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0729 
Symbol 
ID3918553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp770732 
End bp772492 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content64% 
IMG OID640443461 
Productglycosyl transferase family protein 
Protein accessionYP_496010 
Protein GI87198753 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.155064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGC GCAGCTTTCT GGGCGGGTTC TGCATCGCCT TTTTCCGCGA TCCCGTCCGG 
GCGCTGGTGG CAGCGTTCTG GTTCGCAACC GGCAAGCGAG TGCGGGCGCG CAATCGGCTG
CACCTGGTGC TGACCATGCC GGGGACGGCC TACGATCGGT GGATCGCGGA AGTGGAGGGG
GCGGATCTTC CCGAGGATCG CCCGGCCGAC CGGGCTTCGG GGCAGGCATC GCCGCGCTTC
AACGTGCTGA TCTGCGTGCC TGACGACGAC GCCGCCCGGA TTATCCGCCG ACAAATCGAA
GCGCTGATCG CACAGGGTTG GCCGTACTGG ACCGCGAACA TCTTCGTGGG CGCAGACGTG
ACCATGCCCG ACTTGCCGGA TGATCCCCGC ATTCGCTTGC TGCCTTCGAT GGGGGACGGC
GAGACCCTCA AGGAGACGTA CCCTGCCTTG GAGACAAAGG GCGGTTACGT CGTGCCCCAT
CACCAGGGCG CGATCCTTTC CGGCAGCGCC CTCCAGCGCA TGGCCGATGC CATTCGTGGA
AGCGGCAATC CCACGATCAT CTTTGGCGAC CACGACCATC TCGCGGGTCT GAACCGCAGG
CACACGCCAT GGTTCAAGCC GCAATGGAAC GCGGAAATGA TCCTCGCCCA GGACTATGTG
ACGCAAGTCC TGGCAATCCG TGAGGACGAG GCACAGGCGC TGCGGCTCGA CGGCGGGTCT
TGCGCTGCCT ATGCCCTTCT CCTTGAACTG AGCCGAACGC CGGGTTTCAC GGCAGTACGC
GTTCCCCATA TCCTTGCCCA CGTCGTCGAC GATCACGCTC CGGATGCTTC CGCCATCCGG
GCGATTGTCG CGCAGCACGT CGCGCATCGA GCCGGGATCG CAACGGCAGG CCCGTTCGGC
ACGGTCCGCG TCGCCTGGCC GTTGCCCGAT CCCTTGCCAC TGGTGAGCGT GATCGTGCCT
ACGCGGGACC AGCCGAGGCT TCTTCGTGCC TGCATGGACG GCCTTTTGCG CGACACGCTC
TACGCGCCCA TGGAAATCCT CGTGGTCGAC AACGGGACCA CCGATCGGCA AGCGCTGGCA
CTAATCCGCG AACACAGTGC CGACCCGCGA GTGAGGGTCC TGTCGGCACC GGGCCCCTAC
AATTATTCCA GGTTGAACAA CCGGGCGGTC CGAGAAGCTG CGGGAGAGTA CGTCTGCCTG
CTGAACAACG ATACGCAAGT CATCAAGGGG ACATGGCTGC ATGAGATGAT GCGGCAGGCA
TCCCGACCGG AAGCGGGCGC TGTAGGTGCC ATGCTGCTCT ATCCGGACCA CACGATCCAG
CATGCAGGCG TCGTGGTCGG AATGGGAGAG GCAGCAGGCC ATGCGCATCG CTTCCAAAGC
GCAGATGGCG CGGGCTTCTT CGCCCAGGCC CATGTCCAGA GATACGTGAG CGCCGTCACC
GCCGCCTGCC TCGTGGTCAA GCGCGAGAAG TTCCTCGCGG TCGACGGGCT GGACGAGGAG
GGGCTGCCGA TCGCCTTCAA CGACGTCGAC CTCTGCCTGA AGCTGCAGCG GGAGGGGTGG
CGCAACCTTT ATTGTCCGCA GGCGGTGATG GTACACCATG AATCGAAGTC CCGTGGCAAG
GATTTCGCGC CGGACCAGCG TGACCGCTAC ATGCGGGAGT TGTCGGTACT GCAAGGCCGG
TGGGGCACAG CCCGGTATCA GGACCCGCTG CACCACCCGC GCCTGAAGCG CTCCAGCGAA
ACCTATATCC TCGATTATTG A
 
Protein sequence
MSARSFLGGF CIAFFRDPVR ALVAAFWFAT GKRVRARNRL HLVLTMPGTA YDRWIAEVEG 
ADLPEDRPAD RASGQASPRF NVLICVPDDD AARIIRRQIE ALIAQGWPYW TANIFVGADV
TMPDLPDDPR IRLLPSMGDG ETLKETYPAL ETKGGYVVPH HQGAILSGSA LQRMADAIRG
SGNPTIIFGD HDHLAGLNRR HTPWFKPQWN AEMILAQDYV TQVLAIREDE AQALRLDGGS
CAAYALLLEL SRTPGFTAVR VPHILAHVVD DHAPDASAIR AIVAQHVAHR AGIATAGPFG
TVRVAWPLPD PLPLVSVIVP TRDQPRLLRA CMDGLLRDTL YAPMEILVVD NGTTDRQALA
LIREHSADPR VRVLSAPGPY NYSRLNNRAV REAAGEYVCL LNNDTQVIKG TWLHEMMRQA
SRPEAGAVGA MLLYPDHTIQ HAGVVVGMGE AAGHAHRFQS ADGAGFFAQA HVQRYVSAVT
AACLVVKREK FLAVDGLDEE GLPIAFNDVD LCLKLQREGW RNLYCPQAVM VHHESKSRGK
DFAPDQRDRY MRELSVLQGR WGTARYQDPL HHPRLKRSSE TYILDY