Gene Saro_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1862 
Symbol 
ID3917083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1962769 
End bp1963989 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID640444606 
Producthypothetical protein 
Protein accessionYP_497136 
Protein GI87199879 
COG category[S] Function unknown 
COG ID[COG5441] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0341328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA AGCCCAGCGT CCTGTTCATC TGCACGCAGG ATACCGAGGA AGAGGAAGCC 
CGCTTCACCC GCGCCGCGCT CGAGGCGGCG GGCGTCGAAG TCGTCCACCT CGATCCCAGT
GTCCGCCGCT CGCTCGGCGG GGCGGAAATC TCGCCGGAAA TGGTCGCCCA GGCCGGCGGA
ATGACCATCG AGGAAGTCCG CGCCCTCGGC CACGAAGGCA AGTGCCAGGA CGCGATGATC
CGTGGTGCCA TCGCCGCCGC GCACGAATGG GACGCCAGAC ACCCCGTCTC CGGCATTCTC
GCGGTCGGCG GCTCGATGGG CTCGGCGCTT GCCGGTGCGC TCATGCAGAG CTTCCCCTAT
GGCCTGCCCA AGCTGATCGT CTCGACCATG GCCTCGGGCT TCACCAAGCC CTACATGGGC
GTGAAGGACA TCGCGATGAT GAACGCGGTG ACCGATATCT CGGGCATCAA CACGATCAGC
CGCGACGTCT TCCGCAACGC TGCCAACGCC GTTGCCGGAA TGGCGAAGGG CTACGACCGC
GACAAGGGCC CCGAAAAGCC TCTCGTCCTC ATCACCACGC TCGGCACGAC GGAAACCAGC
GTGAAACGCA TCCGCCAGGC ACTGGAAAGC GATGGCTGCG AAGTCATGGT CTTCCATTCC
TCCGGCGCGG GCGGCCCCAC GCTCGACGGG CTCGCCGCCG ACAAGGACGT GGCGCTGGTC
CTGGACCTTT CCCCGACCGA GATCCTCGAC CACCTCTTCG GCGGCCTGGC TGATGCCGGT
CCGGATCGCG GGCGCGCGGC CCTGCGCAAG GGCATCCCGA CGATCCTTGC CCCCGGCAAT
GCCGATTTCA TCATCGGCGG TCCGATCGAC GCCGCGGAAG CGCAGTTTCC AGGCCGGCGC
TACCACCAGC ACAACCCGCA GCTCACCGCA GTCCGCACCA ACGTCGCGGA CCTTCGGAAG
CTGGCCGATC ACCTTGCCGC CAACGTGCGC GAGGCCAAGG GCCCGGTCCG GGTCTTCACC
CCGCTCAAGG GCTTTTCCAG CCACGACAGC GAAACGGGCC ACCTGCTCGA CCTCTCGGTG
CCGGGACCCT TCGCCGAATA TCTCGCCAGC GTCATGCCAG GTCACGTGCC GGTGACCGCC
GTGGACGCCC ATTTCAACGA CGAAGCCTTC TCCAGCGCGG TCATTGCCGC CGCGCGCGAG
ATGCTTGCCG CAAAGAACTG A
 
Protein sequence
MTDKPSVLFI CTQDTEEEEA RFTRAALEAA GVEVVHLDPS VRRSLGGAEI SPEMVAQAGG 
MTIEEVRALG HEGKCQDAMI RGAIAAAHEW DARHPVSGIL AVGGSMGSAL AGALMQSFPY
GLPKLIVSTM ASGFTKPYMG VKDIAMMNAV TDISGINTIS RDVFRNAANA VAGMAKGYDR
DKGPEKPLVL ITTLGTTETS VKRIRQALES DGCEVMVFHS SGAGGPTLDG LAADKDVALV
LDLSPTEILD HLFGGLADAG PDRGRAALRK GIPTILAPGN ADFIIGGPID AAEAQFPGRR
YHQHNPQLTA VRTNVADLRK LADHLAANVR EAKGPVRVFT PLKGFSSHDS ETGHLLDLSV
PGPFAEYLAS VMPGHVPVTA VDAHFNDEAF SSAVIAAARE MLAAKN