Gene Saro_0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0346 
Symbol 
ID3918230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp372308 
End bp373762 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content62% 
IMG OID640443075 
Producthypothetical protein 
Protein accessionYP_495628 
Protein GI87198371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC CTGAAGACGC ATCCGATCCG CAGACGAACC AGAACGAACC GCAGTCGGAG 
CGAGACGTCG ACGAAAGCCT GCCCGCTCGG GACATCCTCA AGATAGGTTC TATGGTGAAA
GCGCTGGGTG CCCAGCAGAG CAAGTTCGAT GCGGTCACGA GGTTGGCCCA GGTAGGCGCC
CATGCCCAGC TGAGCACCAG CGCCCAGATG TTGGCCCTGT CAGACAAGGT TTCTGCGCTC
ACCAACATCG GCGCGACGAG CAGGCTTGCC CGGATGCTAG AAGAGACAAA CCGGCATTCA
CGCCATCTGC AGGATATGAC CAGCACCGCG CGCCTGCTCG CGCAGAGCGG TAAACTTGCG
CAATACTCAA AACTGGCGGC CCTCGCGAGC GGCTATCCAG GCAGCACGCT GCCGAACTGG
GCGATCCTTA ACGACCATTC GAATATCGAT CAGTTGATGC GCTCGCAGCG CTGGGCCGAT
CATTTCTCGG CCGACTCCTT TGGCTATGCA ACGGGCCGCG CCGGCAGGTT CGCAACCGGC
GGGCTTGTTA GCGAGCAGCT TCGGCTCGTG GCGGAACGCG CAGCCGGCAT CAGGGTCGCC
GCTGGGATAG GTGCGCTGCA GGTTTCGCCA GGGGTCAAGG CGATGCTGGC CCGCAATGTC
CGTATCGGCG AAACACTCTC CAAACTTTCC GCTTTCGCTG GCGCCATCGA TACGATCGGC
GCGTTCAGTC CGGCGAGCAG CGCAGCCGTG GATTCGCTGC TTGGCGAATG GCAGACCCGA
CCTGACCTAC CCGAAGAATT CTGGCACCGC CGTTCGGTCC GTCAGCGTTA CTATGAGGAA
GCAGAGGTCG ACCGCGGCCT CATCGACACC CGCAACACCG AGATCGTCGA GGTTCTCGTT
GAGAGCGGAC TTGTCGAGGG CTACGTTCGC GGAAAGCGCG TTACCGCCGT GCTTGAGGCC
GGCCCGCTGA GCGTCGAGGT GTCGGCCTCG CGCACGCGCA TCGGAACCTA TCGGGCGATC
ACCGGCTTTG AAATCGCGAT GCGCGGCCTC GTCAGGCGAG TGCTCGTGCA AGCCCAGATC
GACGCTGGCG AAAACCCGGA CGCGTGGTTC AAACAGCGCG TTCCCGGCGA TATTCTACAG
CGAGCAAAAG AGCGCAAGGC AGATGCCGAA CGCGTGGGCG AACAGGCAGC CGATCTCATA
GCGTTTGTCG ACTTGGGCGA TCTGATCCCA ATCGTGACGC TTAAAAAGAA CTGGCCGGTA
TTCCAGCCGA TCTTCGGCAA CGCGGAGGAT TTCCGGGTGG ACATGCGCCG CCTGAACGCG
ATCCGCAGGC CCGCAATGCA CTCCCGTTCG ATCGATCCTG TCCAGTTCAC CGAGATGGTC
ATCGTCATCG ACCGCATCAG CTCCATGATA CGGGGCAGCT TTGGCTGGAT GGCCGAGTGG
GACGAAGAGG GCTGA
 
Protein sequence
MSEPEDASDP QTNQNEPQSE RDVDESLPAR DILKIGSMVK ALGAQQSKFD AVTRLAQVGA 
HAQLSTSAQM LALSDKVSAL TNIGATSRLA RMLEETNRHS RHLQDMTSTA RLLAQSGKLA
QYSKLAALAS GYPGSTLPNW AILNDHSNID QLMRSQRWAD HFSADSFGYA TGRAGRFATG
GLVSEQLRLV AERAAGIRVA AGIGALQVSP GVKAMLARNV RIGETLSKLS AFAGAIDTIG
AFSPASSAAV DSLLGEWQTR PDLPEEFWHR RSVRQRYYEE AEVDRGLIDT RNTEIVEVLV
ESGLVEGYVR GKRVTAVLEA GPLSVEVSAS RTRIGTYRAI TGFEIAMRGL VRRVLVQAQI
DAGENPDAWF KQRVPGDILQ RAKERKADAE RVGEQAADLI AFVDLGDLIP IVTLKKNWPV
FQPIFGNAED FRVDMRRLNA IRRPAMHSRS IDPVQFTEMV IVIDRISSMI RGSFGWMAEW
DEEG