Gene Saro_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3943 
Symbol 
ID5077427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp114946 
End bp116901 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content58% 
IMG OID640481049 
Producthypothetical protein 
Protein accessionYP_001165711 
Protein GI146275550 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCAAGC GCTTACGTCG TGCAGCGAGA CAGGGAACCC ACTGGTTGAC TGCCGGGCGT 
TCTCGCGAGC ATCAAATCAA CTCTGCGGCA CCGTTGAAAA TCCCGTTCGA GAAGGACACT
GTCGAAGGAT ACATAGCGAG TCTGGATGCG CTGATCCGTG CGCCCGGAAC CCACGTGTAT
GTCGACACCT CGTTTCTGGT TTGGCTAACT GCCCTCGGGT GCGAGGCCCG CAACGAATTC
ACAGGATGGC TGCGGGCAGT AGCGGCAGGT CGGGTGCATG TCCCTGTGTG GGCGGCGCAC
GAATACCTGC GACACCACAT GGGGAATCTG CACGGCAAGA AGCTGAAGGG CATCGAAACC
GCGTTGAACG ATCTCGCCAA CAATACCTTC AACGACCTTC GGCCGTACAT CGATACCTCA
TTCACAGGCG ACAGCCGGTC GCCAGCAGAA ATCATTGCCG CGACGAGGTC AGTGCTTATT
GACGTCAAAC GCGTCGCCGC CATCGCCGGG CGGTGGACGA AGCAGCACTA TGACAGCAAT
TCTAAAGCAA TTATCGAATT CATCAATGAG TGCGGCCTTC CAAGCGCACC GATGCTCGAC
TGGATGGGCG ACATCCAATC CGTAGAGGAG GCCCGTTTCG AAGGCCGCAT ACCCCCAGGC
TTCCAGGACC GGAACAAGTC TGGTGCGAAC GGAGGGGGAG CCAACAGCTT CGGCGATCTG
ATGTTCTGGA AAGAGATCCT TCATCACGCT GGTCAACGGC GCGCTCGAGG CGTCGTGGTC
ATCAGCAACG ACGGCAAGAA CGATTGGGTC ATGGGAGGGC TAGATCAACC GGACCTGGAT
GCGGAACTTA AGGTGATCGC GAGCAAGCTA CCCCCGATCC CAAGGCCGCA ACCGATGCTT
GAATATGAGG CCAAAGCGTC CGCCGGTGTT CAAGAACTGA TGCTGGTAGA TCGGAAATAT
CTCGCGATCT ACTTGCGCCG CACCGGCGTC CCAAGTGACA GGTTCTTTGG TTCGGCTATC
GACGTCACGC TTCCCTCCCC TGACCGAGAA GACGAGGCCA TTCGGAAGCA AGCGCGTGAC
CAAGCGACCG GCAGAACTTC CGTCGCGAGC GGAATAGAGC CACAGAGAGA CACGCCAAAG
CACCTGCCAG TGGATGACGC TTCTGGCATT GCTGACAATC CTCTCGCTCT TCGGCTTGCG
TTCAGTGCCA GCAGCTCGGA CGCAAACGAG AAATCCGGCC CTCTCCTCGA TCACATGCTT
GCCAATGATG CGGAAGGACT AGGCCTAGAT GCATTCCTGA CAAAGGAAGC GTTGGCTAAC
TGGGACGGCC GGGCGGCTGT TTGGTTCGGT CGATCGCTGG GCACCAGATC CATCGAGGGC
AATGCCCTAG CTACCACCTA CACGACGGAC TTGCTCGGCG TGTTCGAGCG GCTGCCGCCG
AGGACAGCAA CTAACCTTTA CCTTGGCCTC CTGGCGTCGG CCTACGTTGA TGGGTCATCG
CTCAAGACAA TTCCCCGCAC ACCTTGGCTG CCTCGGCTCT TAGCGCTTCA AGGGCAACCG
CGTGCCAAGG GGGCCATCGA CGCGTTCCGG AACATTGTCG CTGATTGGCC CGGTCGCCCA
GTCTATCTGC CAGACGCGGA TCGACCTGCA CTGTCGGTCA AACCACTACT TGCGAAGGCT
ACCGGCACCG CGCCCCGCCT GACAGGTCTA CAGATTGGGG GGATTGGCGT AATCGTGGAA
GCTCAGGAAG ATGCAGGACT TCGCTTGGCC AACCGGTTTC CAGGAGTAAC GACCGTCGTT
CTCGGCGATG TCGTCAAAGA TGTGTGCAAC GCTCTGGGCA TACCGTTTGA CCAATTGATG
GCGCATGAGG CCTTCGAGCG AGAGGTCGCT TTCGGAAGTA CCGTCGGGAT TGCTGCCGAA
GGGGATCTTA GAAATAGTAT GGAAGACCAA TCATGA
 
Protein sequence
MIKRLRRAAR QGTHWLTAGR SREHQINSAA PLKIPFEKDT VEGYIASLDA LIRAPGTHVY 
VDTSFLVWLT ALGCEARNEF TGWLRAVAAG RVHVPVWAAH EYLRHHMGNL HGKKLKGIET
ALNDLANNTF NDLRPYIDTS FTGDSRSPAE IIAATRSVLI DVKRVAAIAG RWTKQHYDSN
SKAIIEFINE CGLPSAPMLD WMGDIQSVEE ARFEGRIPPG FQDRNKSGAN GGGANSFGDL
MFWKEILHHA GQRRARGVVV ISNDGKNDWV MGGLDQPDLD AELKVIASKL PPIPRPQPML
EYEAKASAGV QELMLVDRKY LAIYLRRTGV PSDRFFGSAI DVTLPSPDRE DEAIRKQARD
QATGRTSVAS GIEPQRDTPK HLPVDDASGI ADNPLALRLA FSASSSDANE KSGPLLDHML
ANDAEGLGLD AFLTKEALAN WDGRAAVWFG RSLGTRSIEG NALATTYTTD LLGVFERLPP
RTATNLYLGL LASAYVDGSS LKTIPRTPWL PRLLALQGQP RAKGAIDAFR NIVADWPGRP
VYLPDADRPA LSVKPLLAKA TGTAPRLTGL QIGGIGVIVE AQEDAGLRLA NRFPGVTTVV
LGDVVKDVCN ALGIPFDQLM AHEAFEREVA FGSTVGIAAE GDLRNSMEDQ S