Gene Saro_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1008 
Symbol 
ID3915790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1050070 
End bp1051209 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID640443742 
Producthypothetical protein 
Protein accessionYP_496287 
Protein GI87199030 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGT TCCCGTTCGA CCGCCTGGTC TTCACCCCCT TCATGGTCGA CCTGGAGCGT 
TCTCCGCTGC GGGGGCATTT CGGCGAGGAA ACCTATGTCC TGGGTGCGTT CAATCCTGGG
ATGACGGTCC TGCCGAACGG CAACCTCGTG TTCATGGTGC GCATTGCCGA GGCGCTTCGC
CAGCCGATCC GCGACGGCAA GGTCCACGCG ATCCGGTGGG AAGACGGCGG CTACGTCCTC
GACGGCTGGC CGCTTGAACT GGTCGACACT TCCGATCCGC GCAAGTTCCT GCTCCACGGC
GGCGGCTGGA AGATCATGGC GCTGACCTCG TTGTCATGGC TGCTGCCGGT CGAAATGTCG
CCCGACGGGC TCGACGTGAT ATCCATCCAC TATGACAAGG CCATCGCGCC GCAAGGTTCG
CATCAGTGCT ATGGCATCGA GGACGCGCGC ATCTCGCGCA TGGGCGAGGG TGCCTATCTG
ATGACCACCT GTTCGGTCAG CCCCGAGCGC CATTCGACGA CGCTCTACTC CTCGGACAAC
GGGCTCGACT GGACATTCGA GGGCATCGTC CTCGATCACC AGAACAAGGA CATGTTGATC
TTCGAAGGCC TGATCCACGG TGAATACTGG GCTCAGACGC GCCCGCTCGG AGACCTCTAT
TTCGCCTACC CGCCGGGCAG CGAATGGCGC TCCGGCCCGT CGATAAATCT GTCGACCTCG
CCCGATGCCC TTCACTGGAA GCCCTGCCTC AAGCCTGGCA TCCGGCCCCA CGCCGGCACG
GCGGCAACCG CGCGCATGGG CGGCGGCACG CCGCCGATCC TCACCGAGAT CGACGGCAGG
CGCGGCTGGC TGAGCCTGTG GCACGGGGTG GAGCCCAAGG AGATCGTCGG CATCTATCGC
ACCTACTGGT CGCTGCTCGA TCCGGACGAT CCGTCGATCG CCATAGCCGC AAGTCATGCG
CCGCTGCTTG AACCGGACGC GGAACTGACC CGCCCGCTTG AAGACCTGCT TTACCTGCGC
GACGTGGTGT TCACCACCGG CATCGCGGAA GTCGGTGATC GCTTCATCGT GGCCTCGGGC
GAGGCCGATC TTGCCTGCCG CATCACCCAT GTGCCGAAGG AAGCCTTCCG TTCCGCGTGA
 
Protein sequence
MTQFPFDRLV FTPFMVDLER SPLRGHFGEE TYVLGAFNPG MTVLPNGNLV FMVRIAEALR 
QPIRDGKVHA IRWEDGGYVL DGWPLELVDT SDPRKFLLHG GGWKIMALTS LSWLLPVEMS
PDGLDVISIH YDKAIAPQGS HQCYGIEDAR ISRMGEGAYL MTTCSVSPER HSTTLYSSDN
GLDWTFEGIV LDHQNKDMLI FEGLIHGEYW AQTRPLGDLY FAYPPGSEWR SGPSINLSTS
PDALHWKPCL KPGIRPHAGT AATARMGGGT PPILTEIDGR RGWLSLWHGV EPKEIVGIYR
TYWSLLDPDD PSIAIAASHA PLLEPDAELT RPLEDLLYLR DVVFTTGIAE VGDRFIVASG
EADLACRITH VPKEAFRSA