Gene Saro_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2168 
Symbol 
ID3918833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2305885 
End bp2307321 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content63% 
IMG OID640444923 
Producthypothetical protein 
Protein accessionYP_497441 
Protein GI87200184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.355326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAG CCATCGCTGC GCTTCTGGCA GCCGCCCTCC CCTTCGCCCT CCCGCTGTCT 
GGCGCATCGG CCGACGTCGG CAGTTCGATG GACTCGTTTC TGAACGACGT CGGGGGCGCT
GCCAACGTCA ACGGACCGAC CGCGTTCCAG GGGCAGTCGG CCGGCTATTA CAGCCTCGGC
AATGTCTGGA CGCGGTTCCC GCAGAAGACG ACCAACATCG CCAATCTGCA GTTGCCGCGC
GCTCGCGCCG GTTGCGGCGG CATCGACATC TTTGCGGGGT CCTTCAGCTT CATCAACGCG
AGCGAGATCG TCGCGATGCT GAAGGCAGTC GCGAACAATG CTGTCGGCTT TGCCTTCAGC
CTAGCGATCG ACACGGTCTG CCCGGAATGC TCGAAGATCA TGCAGGAGTT CAGCCAGAAG
GCTCAGCTCA TGAACAACCT CAACATCAAC TCCTGCGAGA TGGCCCAGGG TCTGGTGGGC
GGGATCTGGC CCAAGGGCGA CCTTGCCGAC AAGGCGATCT GCGAAGCGAT CGGCAACTCC
GAAGGGATTT TCACCGACTA TGCCGCGGCC AAGCATGGGT GCGGCACCAA GGGGCAGCGA
TCGAGCACGA CCGCGCAAGG CTCAGGCAAA TATGATGACG TCAATCCCGG GGTGCCGCGC
AACTACACCT GGACGATCCT CAAGAAGTCG GCGTTCTTCT CGCCAGGCGG CCGGTTCGAC
GAAGAGCTCG CCGAATATGC GATGACGCTG CTCGGCACGA TAATATACGT GCCCCCAAAG
GACGATGAGC TGGGCAAGTT CGTGCCAATC GTCGGCGAAG CCTCGTCCAC CCTCGTGACT
TCGCTGCTGG ATGGCACGGC GAATGGCAAT GTCCTCATTT TCGACTGCGA CGAGCCGGAA
AAGTGCCTCA ACCCGGGCTT CAAGTCGCTG AGCCTGCCGG CATCGAAAGC GCTGCGGCCG
CGCGTGGCTG CGCTCATCGG CGGTATGGTT CAGGCCATCC GCGACGACAC CGCGATCAGC
GAAGAGCAAA AGGAACTGCT GCAAGTCGCG TCTATCCCGC TCTACAAGAT CCTGACCGTC
CAGGCGGCCT ATGGCCGGGG CATGCCGACC GACGACCGGG AGACCCTGGC CGAGATCGCC
AGTGTCGACC TGCTGTTTGC TGTGCTCGAC CGGATAGTGA GCGAGGCGGG CCGCTCGATG
TCGAGCTTCA TCGGGGCCGA CGAAGCCAAG ATCGCCATGT GGCAGAATCA GGTCAATGTC
GTGCGTCAGG CGCTCGCTGA CCGGCAGGCC AACACGCATC TCAAGGTCAA TGCGGTGCTG
CAGATCATCG AGAAGACGGC GTTCATCGAG AACGTGCTGG CCGCCTCGAT GTCGCCCGGA
ATGGCCGCAT CGCTGGACTG GTCGCGCGGC GTCCAGAGCC GCGCCCTTAC CCACTGA
 
Protein sequence
MKRAIAALLA AALPFALPLS GASADVGSSM DSFLNDVGGA ANVNGPTAFQ GQSAGYYSLG 
NVWTRFPQKT TNIANLQLPR ARAGCGGIDI FAGSFSFINA SEIVAMLKAV ANNAVGFAFS
LAIDTVCPEC SKIMQEFSQK AQLMNNLNIN SCEMAQGLVG GIWPKGDLAD KAICEAIGNS
EGIFTDYAAA KHGCGTKGQR SSTTAQGSGK YDDVNPGVPR NYTWTILKKS AFFSPGGRFD
EELAEYAMTL LGTIIYVPPK DDELGKFVPI VGEASSTLVT SLLDGTANGN VLIFDCDEPE
KCLNPGFKSL SLPASKALRP RVAALIGGMV QAIRDDTAIS EEQKELLQVA SIPLYKILTV
QAAYGRGMPT DDRETLAEIA SVDLLFAVLD RIVSEAGRSM SSFIGADEAK IAMWQNQVNV
VRQALADRQA NTHLKVNAVL QIIEKTAFIE NVLAASMSPG MAASLDWSRG VQSRALTH