Gene Saro_2628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2628 
Symbol 
ID3917061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2864556 
End bp2865608 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content65% 
IMG OID640445405 
Productglycosyl transferase family protein 
Protein accessionYP_497898 
Protein GI87200641 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGCC GTCCGCCCGC CCTGTCCCGC ATCCCCGAGC ACCCCCGCGT CGCCGTCATC 
GTGCCAGCAT ATGGCGTGGC TCACCTTGTC GGCGAGGCGT TGCGATCTCT CCAGCGCCAG
ACTCTGGAAG AGTGGGAATG CGTGGTGATC GACGACGGGG CACCCGACGA TGTTACCGCA
GCCGTAGCTC CATTCCTCGA TGACCGCCGC ATCCGCTTCC TCGCCACGCC GAACGGCGGC
GTGTCTGCGG CACGGAACCG GGCCATCGCC GCATCTTCGG CACCCCTGAT CGCCTTGCTC
GACGGGGACG ACCTTTTCCG TCCTTCATAT CTCGAAACGA TGGTAGCGGT TCTGGAGGCT
GACGCGGAAG CACGGCTCGC AACCTGCAAC GCCCGAATCT TTGGTGCGGT CGCGCGCGAG
CGCACATGCG TGGAGCGCCG CCAGGGCAGC GGCGACGGCA CGAAGGGCTC ACTCGCCGAC
GTGCTCGATC GTTCCTTCAA CGTCTATATC GGGACGACCT TTCGCCGGGC AGACTTCGAG
CGGGTCGGCG GCTTCGACAC GACCATGGCG CAATCCGAAG ATTTCGATCT GTGGGTCAGG
CTGATGATGC TGGGCGGACA CGCGCACTAT GTCGATGCGG TTCTCGGCGA TTACCGCGTA
CGCCCTGGCT CGGCTTCCAG CAACGCGGGC AGGATGCTTC TCGGCAACAT CAAGGTATAC
GAGAAGGCCC GCTCACTCCT TGCACCGGAC CGACCGGAGC GTGAGCTGAT CGAACGCCTC
ATCGCCGATA ATCGCGCTTC CCTGGATTTC GAGCACGCGA TGGATCGCAT CATCGACGGG
GACGCGCGAA AGGGAATCGC GGAGCTGAAG AAATCGGTGG CAGCAGGCCA GATGGTCGGC
GGCCCGGTCT GGCGCCTCGC GTTTCTCGTC TGGCAACTAT TCCCCTCCCT GGCGAGGCCG
ATGCTGCGTT GGCGGAGACG GGCGCACAGT CGCGGCGGTT CAGGCGTGGG CGGATCGGCC
ATGTTCACCA GCTTCGTGGA GATCGAGGGG TGA
 
Protein sequence
MNGRPPALSR IPEHPRVAVI VPAYGVAHLV GEALRSLQRQ TLEEWECVVI DDGAPDDVTA 
AVAPFLDDRR IRFLATPNGG VSAARNRAIA ASSAPLIALL DGDDLFRPSY LETMVAVLEA
DAEARLATCN ARIFGAVARE RTCVERRQGS GDGTKGSLAD VLDRSFNVYI GTTFRRADFE
RVGGFDTTMA QSEDFDLWVR LMMLGGHAHY VDAVLGDYRV RPGSASSNAG RMLLGNIKVY
EKARSLLAPD RPERELIERL IADNRASLDF EHAMDRIIDG DARKGIAELK KSVAAGQMVG
GPVWRLAFLV WQLFPSLARP MLRWRRRAHS RGGSGVGGSA MFTSFVEIEG