Gene Saro_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0079 
Symbol 
ID3918510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp81777 
End bp82778 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID640442804 
Producthypothetical protein 
Protein accessionYP_495362 
Protein GI87198105 
COG category[S] Function unknown 
COG ID[COG1426] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCGAAG CCGGCGGTCT TAACGACATC AGCAATATCG ACACTACCCC GTTCCCCGCC 
GGCGAGCCAC GCCGTGGCGG CGTGGGCGAG ACCTTGCGCG CCGCGCGCGA GGCGGCGGGT
CTGGACATCA AGCAGCTTTC GCTGAGGACG CGCGTCACGA CGCGCCATCT CGAAGCGCTC
GAAAGTGGCG ACTATTCGGT CCTGCCGGGC CGCCCCTACG CGCTCGGCTT TGCCAAGAGC
TATGCCCGCG CGGTGGGCCT TGACGACAAG GCCATCGGGG AGGCCGTCCG TGCCGAACTG
AACCGGCAGG CGCCGCCGCC GCCGCCGCGC GTCATCAACC AGTTCGAGGT GGGCGATCCG
ATCAAGACGC CCTCGCGCCT GACAGGCTGG CTTGCTGCCG GTCTGGTCGT GGCGATTGCG
GCTGCGGGCC TCACCTTGTG GCGCAGCTAT TACCTGCCGT CGGCGGAACT GCCCCCGCTG
GTCGGTGCCG AGGAAGCCAG TCCCGCGCCC TCGCAGGTCG CGGTTGTCCC GCTGCCCAGC
GCCGCGCCTT CGGGCCCGGT GGTCTTCACC GCCCGCGAGA ACGGGGTCTG GGTCAAGTTC
TACGACGGTC AGGGCCAGCA GATCCTCCAG AAGGAACTCG CCAAGGGCGA GACCTTCACC
GTGCCCTCTG GCGCACAGAA TCCGATGCTC TGGACAGGGC GGCCCGATGC GCTTGACATC
ACCGTCGGCG GGCAGGCCGT ACCGCGTATC GCCGAACGCG AAGGCATCGT GAAGGACGTG
CCGGTCAGCG CCGCCGCACT CATGGCGCGT GGCACCACGC CTGCGCCCGC CGCCGTCTCG
GCGGGGGCAG AGCAGACCTC GCAAGTGGCG CCATCCGCGC CTCGCCCGCG TCCGGCCGTT
GCGCGTCGTC CGGTGGTGGC GCAGCCGTCC GCCTCACCCG TTTCGGATCT TCGCCCCGCG
GAAAGCACGG AAACGGTTGC GCCTTCCACC GGAATGAATT GA
 
Protein sequence
MAEAGGLNDI SNIDTTPFPA GEPRRGGVGE TLRAAREAAG LDIKQLSLRT RVTTRHLEAL 
ESGDYSVLPG RPYALGFAKS YARAVGLDDK AIGEAVRAEL NRQAPPPPPR VINQFEVGDP
IKTPSRLTGW LAAGLVVAIA AAGLTLWRSY YLPSAELPPL VGAEEASPAP SQVAVVPLPS
AAPSGPVVFT ARENGVWVKF YDGQGQQILQ KELAKGETFT VPSGAQNPML WTGRPDALDI
TVGGQAVPRI AEREGIVKDV PVSAAALMAR GTTPAPAAVS AGAEQTSQVA PSAPRPRPAV
ARRPVVAQPS ASPVSDLRPA ESTETVAPST GMN