Gene Saro_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3003 
Symbol 
ID3917439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3219642 
End bp3221714 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content61% 
IMG OID640445782 
Producthypothetical protein 
Protein accessionYP_498272 
Protein GI87201015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACG AACTCTACGA ACCCGATGAT GCTCTTGAAG AGCAGACCCG CGACGAGGAA 
AAGCTGCGCG AGGTCCATGC CCGCGCGCTT GCCCGGTTCG ATGCCATCGC CTCCGCGACG
CAGGAATGCC GCGCCAAGAG CCTGGAAGCC CGCCGCTTCA TCACGATCCC CGGTGCGCAG
TGGGAAGGCG AGTGGGGCGA GCAGTTCGAT AACTCGATCA AGCTCGAAGT TGACAAGGTT
GGTCGCGGCG TCGCCAAGAT CGAAACCGAC TACCGCGAAA ACCGCATCAT CCCGGACTTC
CGCCCCGATG GCCCGAATGC CGATCAGGAT ACGGCGGATA TGCTCGATGG TCTGCACCGT
GCCGACAGCT ATCGGTTCAA GTCGCAGCAG GCCCGCGACA ATGCGTTCTT CGAGGCCGTT
GCCGGTGGCT TCGGTGCCTA TCGCCTGACC AATGAATGGG AAGACGAGAG CGACAAGGAC
AACGACCACC AGCGCGTCAA CCCGGCATCG ATCATTGTTG ACGCTGACCA GTCGGTGTTC
TTCGATCTAC AGGCGCGCAT GTATGACAAG TCCGATGCGC GCTTTGCCTT AGTCCGGACC
AAGCTGACCC GCGAAGCGTT CGAGGATGAG TATGACGGCT GCTATTCCGA ATGGCCCGAG
GCTCCGCGCT GGAAGTTCAC GGACTGGTTT GCGCCGGATA CCGTGGCCAT CGCGGAATAC
TACGAGCGCG AGGAAGTATC GGACACGCTC CATATCCTGA CCAACAAGCT CTCTGGCGAG
GAATTGCGCC TGTGGGCTTC GGACATGGAA AAGGGCGTTC TGGCGCAATA CAAGGCCGAT
GGCTGGGCGG TCAAAAGCCA GAAGCGGAAG CGCTGCCGGG TCCACAAGTA TGTACTGTCC
GGTGCCGAGG TTCTGGAGGA CTGCGGTTAT ATCGCGGGCA CCGAACTCCC CATCGTTCCG
GTCTACGGCA AGCGCTATTT CGTTGACGGC ATCGAACGGT GGAACGGTTA CGTCCAGCCC
AAGATGGACA GCCAGCGGCT TTACAATTCC AACGTGTCGA AGCTGGCGGA AACCAATGCG
CTTTCGCCGC GTGAGGTGCC GATCTTCGAT CCGACGCAGA TCGATGCCGT GCAGGAAGGC
CAGTGGGCGC GAGCGAATAT TGACCGCCTG CCGTACCTGA CTGCCCATGC GCTGCGGAAC
CCCGACGGTT CGGTTGCTAT GGCTGGGCCG ATTGGCAAGG TGGAGCCGCC GACGCTCGCA
CCGGTCACGG CGACCCTGTT GCAGATCGCC AACCAGGACT TGCAGGAAGA GCTTAACGAC
GGCGCGGACG AGGTAAAGGC CAACACCTCT GCCGAGGCGA TGGACATTGC AGCCGCGCGC
GTTGATGCGA AGTCGGGCAT CTATCTCGAC AACATGCGCC AGTCCGTGCA GCGCGAGGGC
GAGATCTACA TCTCCATGGC GTCCGAGGTC TATTCCGAGG AAGGCCGCGA AGTCCGCACC
ATGACTGAGG ATGGTGACGA CGGCACGGCC ATCCTCAAGC AGATGAAGAC CGATCCCAAG
ACCGGCGAGA ATGCCACGAT CAACGATCTG GAGCATGGGC GCTACAAGGT GATTGCATCG
GTCACGGAAG CAACTGCGAC CCGCCGTGAC AAGACCGTCA AGGCGATGCT TCGCGTTGCC
GAGGTGGCCA CTGCTGCGCA GGACATGGAA ATGGCGCAGG CTGCCATCGT TACCGCCGTG
ATGAATACGG ACGGCGAAGG CACCGATGGC TTCATGCAGT GGATGCGCAA GGTCAAGGCG
CTCCCGATGG GCCTTGTCGA GCCGAACGAC GAAGAAAAGG CGGAAATGGA ACAGGCAGCG
CAGAACGTGC AGCCCGATCC CATGGCAAAC CTTGCCAACG CACAGGCCAG GCAGTTCGAG
GCGGATGCAG CCAAGAAGGC GGCGGAAGTT GCCGAGACGG AGGCGAACAC CCGCTTGCTC
GACGCAAAGA CCGTGGAGAC GCTGGAGAAG GCGCAGCAGC CTGCGAACGA TCAGCCATCC
ATCCCGCTCA ATCGCGGACC ATACGCGGCG TAA
 
Protein sequence
MADELYEPDD ALEEQTRDEE KLREVHARAL ARFDAIASAT QECRAKSLEA RRFITIPGAQ 
WEGEWGEQFD NSIKLEVDKV GRGVAKIETD YRENRIIPDF RPDGPNADQD TADMLDGLHR
ADSYRFKSQQ ARDNAFFEAV AGGFGAYRLT NEWEDESDKD NDHQRVNPAS IIVDADQSVF
FDLQARMYDK SDARFALVRT KLTREAFEDE YDGCYSEWPE APRWKFTDWF APDTVAIAEY
YEREEVSDTL HILTNKLSGE ELRLWASDME KGVLAQYKAD GWAVKSQKRK RCRVHKYVLS
GAEVLEDCGY IAGTELPIVP VYGKRYFVDG IERWNGYVQP KMDSQRLYNS NVSKLAETNA
LSPREVPIFD PTQIDAVQEG QWARANIDRL PYLTAHALRN PDGSVAMAGP IGKVEPPTLA
PVTATLLQIA NQDLQEELND GADEVKANTS AEAMDIAAAR VDAKSGIYLD NMRQSVQREG
EIYISMASEV YSEEGREVRT MTEDGDDGTA ILKQMKTDPK TGENATINDL EHGRYKVIAS
VTEATATRRD KTVKAMLRVA EVATAAQDME MAQAAIVTAV MNTDGEGTDG FMQWMRKVKA
LPMGLVEPND EEKAEMEQAA QNVQPDPMAN LANAQARQFE ADAAKKAAEV AETEANTRLL
DAKTVETLEK AQQPANDQPS IPLNRGPYAA