Gene Saro_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2071 
Symbol 
ID3917718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2210563 
End bp2211738 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID640444823 
Productglycosyl transferase, group 1 
Protein accessionYP_497344 
Protein GI87200087 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.637859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCGAC TGCTCTCGAT CTCGACGCTC TACCCCGCGC CCGGTCGCAC CGGCTTCGGG 
CGCTTCGTGG CGCGACAGAT GGAGGCGCTG GCGGCGCGCG GGGACTGGCA GGTGACGGTG
ATCAACCCCA TCGGCCTACC GCCGCTGCCG ATCAGGCGCT ACGCCGCCTT GCGCGCGATA
CCGGCGCAGG AACAGCAGGG CGGCGTGACC GTGCATCACC CGCGTTTCAC GCTTGTCCCG
GGTCTTTCGG GGCCGATCAA TCCCGCGCTG ATCGCCCGGG CGGTCGTGCC GCTGGCAAGG
CAGCTACATG CGCAAACGCC ATTCGACATG GTGGACGCGC AGTTCTTCTA TCCCGATGGC
CCGGCAGCGG CGAAAGTCGC GGCGGCACTC GACCTGCCCT TCGCGATCAA GGCACGCGGA
TCCGACATTC ACCTGTGGGG CGAGCGACGG CTTGCGGTGG CACAGATGCG GCGGGCGGCG
GCCGGGGCTT CGGCCCTGCT GTCCGTATCC GCCGCGCTGG CGCGCGACAT GGCCGCGCTC
GGTATGCCGG ATGACCGCAT CCGCGTGCAC TACACCGGGC TAGACGGCAG CCGCTTCCGC
TTGCAGGACC AGGCGCAGGC GCGCCGGGTG GTGGCGCATC TGGTGCCCGG CGACGGCAGG
CTGCTCCTCT GCGTCGGCGC GCTGCTCGCG ATCAAGGGAC AGGATCTGGC GATCCGTGCG
CTTGCCCTTT TGCCGCCGGA CGTGCGTCTC GCGCTTGCGG GAACGGGGCC GGATGATGCG
GCGTTGCGCG CTCTCGTCGC CGAACTCGGT CTCGAACACC GCGTGCATTT CCTCGGCGCG
GTGGAGCACG ACGCCCTGCC CGCGCTGCTT GCTGCAGCCG ACGCGATGGT GCTGCCGTCC
GAGCGCGAAG GCCTTGCCAA TGCCTGGATC GAAGCGCTCG CCTGCGGCGC GCCGCTGGTA
ATTCCCGACG TCGGCGGTGC GCGCGAAGTT GTTCGCGGAA CCAGCGCCGG CCGTGTCGTG
GCGCGCAATC CCGGGGCGAT CGCACAGGCC ATCTTGGACC TGCTAGCCGC CCCGCCCGCA
CGCGATGCCG TCGCGGCGAA TGTCGCGAGC TTCAGTTGGG ACGCGAATGC CGCGGCGCTT
GCAGCGATCT ACGAAGAAGC GGCGACGAAG CCTTAA
 
Protein sequence
MKRLLSISTL YPAPGRTGFG RFVARQMEAL AARGDWQVTV INPIGLPPLP IRRYAALRAI 
PAQEQQGGVT VHHPRFTLVP GLSGPINPAL IARAVVPLAR QLHAQTPFDM VDAQFFYPDG
PAAAKVAAAL DLPFAIKARG SDIHLWGERR LAVAQMRRAA AGASALLSVS AALARDMAAL
GMPDDRIRVH YTGLDGSRFR LQDQAQARRV VAHLVPGDGR LLLCVGALLA IKGQDLAIRA
LALLPPDVRL ALAGTGPDDA ALRALVAELG LEHRVHFLGA VEHDALPALL AAADAMVLPS
EREGLANAWI EALACGAPLV IPDVGGAREV VRGTSAGRVV ARNPGAIAQA ILDLLAAPPA
RDAVAANVAS FSWDANAAAL AAIYEEAATK P