Gene Saro_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2202 
Symbol 
ID3918868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2342460 
End bp2343707 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content61% 
IMG OID640444957 
Producthypothetical protein 
Protein accessionYP_497474 
Protein GI87200217 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID 



Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000618682 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCAAGG GGGCGATCTT TTCGCTGCAC GATGTGCTCG TGACCAAGGG CACGATCAAC 
GCGCCGCTGT TTGAGGAAAC GCTGCGGCTG CTGCGCTATC TCAAGGCGCG GGGTGTCGAA
CCGGTTTTCA TTGGCAACCA TGACTGGACG GTCACCAGTC CCGGCCAGTC GAAGCCGTTC
CGGACCCTGC TCGAAGAGCG GCTCGGTCCG ATCAGCTATT ATATCGGCGG CCAAAACGGG
ATGCCTTATA AGCCACGCGC CGATTCCACC GCGCATATCC TCTCCGACAA GGGCTGGCAG
CGGAATGAGG TCCTTTATGT CGGCAACACA ACCGACGATA TGAAGACCGC TGCCAATGGC
GGCCTGATGT TTGTGAACGT CATGTGGCAC GGAGTGGCGA GCCCCTACGG CTTTCAATTC
GACTCTCCAC GCGACGTCGC GCGCTTCGTC GATTGCCTCT GTCTGGGCCT CGACGGCTGG
TTCTGGGCGC TCGAACAGAG CGATCTGCGG GTCTATGCGC TCGCGCCTTT CACAACGCTC
TCGCCGCGCT ACGCACAAGC GCATGCCTAT TCTGAAAACG CCAAGGCGAC CTCGAAACAC
GGTGCCGGTG ATGCGAATTT CTGGGGCCGT CTGCTCGCGG CGCGCATCTA TTTTTCAGGT
CTCGCTGACG AGATCGACTA TATCACCGCC TATCCCGGGC ACGCGCCTAC TTCCAACGCG
ACGGTGATCA GTGAGGCGCT TAACATCCTG GGGCAGTCAC TGCGCAAGAG CTATCTGCCC
GACCTCATTC TCCGTCACAC CAAAGCGGTG AAATCGCAGA CCGCGCGGGC CTCAGGGGGA
AGCGTGGGCC TCGACAATCA GCTCAACACG ATCCGGCTCA ACCCGGCACC CGTCCGCGGC
GTGGGCGGCA AACCCTATAA GTCGCCGCCC GCGCGCGGCG GCAAGCGTGT CCTCGTTATC
GATGATATCT GCACCGAGGG TAACAGCTTC GAGGGTGCGC GGGCCTATCT GAGGGCCGCA
GGAGCGCAAA CGGTCTGCGT GAGCTGGCTC AAGACGATCA ATAAGGACTA TCGCGCCGTG
TCACCAGCCT TCGGCCCGTT CAATCCCTAC ATCGCGCAAA CCTTCCCGAC ACCGATCGCG
ACCACAACGC ACTGGTATTC GAGCGCGATC AGCTCGCATG CTGCGCCGAC TGACCTCGCC
GACGTCTATA ATCGCTACTT CAGCTGGGCT TGGCCCGCCG ATATATGA
 
Protein sequence
MLKGAIFSLH DVLVTKGTIN APLFEETLRL LRYLKARGVE PVFIGNHDWT VTSPGQSKPF 
RTLLEERLGP ISYYIGGQNG MPYKPRADST AHILSDKGWQ RNEVLYVGNT TDDMKTAANG
GLMFVNVMWH GVASPYGFQF DSPRDVARFV DCLCLGLDGW FWALEQSDLR VYALAPFTTL
SPRYAQAHAY SENAKATSKH GAGDANFWGR LLAARIYFSG LADEIDYITA YPGHAPTSNA
TVISEALNIL GQSLRKSYLP DLILRHTKAV KSQTARASGG SVGLDNQLNT IRLNPAPVRG
VGGKPYKSPP ARGGKRVLVI DDICTEGNSF EGARAYLRAA GAQTVCVSWL KTINKDYRAV
SPAFGPFNPY IAQTFPTPIA TTTHWYSSAI SSHAAPTDLA DVYNRYFSWA WPADI