Gene Saro_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1103 
Symbol 
ID3916399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1146414 
End bp1150610 
Gene Length4197 bp 
Protein Length1398 aa 
Translation table11 
GC content68% 
IMG OID640443838 
Producthypothetical protein 
Protein accessionYP_496382 
Protein GI87199125 
COG category[S] Function unknown 
COG ID[COG2911] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAGG ACGCCGTCCT TCCCGACGAA CCCCGGCGGC CTGAACGTTC GCGCCGGTCG 
CGGATCGTGC GCGGCATCGC GCGGCGAGGC GCGATCCTGC TGGTGGCCAC GATCGCGATG
ATTCTCGCGG CGCTCGTTGT TCTCGACAGT TCGCTTGGCC ACAGGCTGGT TGCGGACCGG
ATCGCGGCAC TCGCCCCCGG ATCGGGCCTG AGGATCGAGA TCGGCCGCAT CGACGGCTCG
ATCTATGGCG CGGCAAAGTT GCGCGATATC CGCGTGAGCG ATCCGGAAGG CGTGTTCCTG
ACCGTGCCCG AGGCGGAACT GGACTGGCGC CCGCTGTCCT GGCTGAAGAC GGGCCTCGAC
GTGCGCCTGT TGGCGTTGCA TCGCGGGACG CTGCGCCGCG CGCCGCGCCT GCGGCCCAGC
GAGGACACGA ACCAGCCGAT CCTGCCCGAC TTCGACATTC GCGTGGACAA GCTGGTGGTC
GACAACCTGA CCGTTTCCGA AGCCCTTGCC GGGCGCAAGC GGCGGGTCGA TGTGGTGGCG
AAGGCCGACA TCCGGGGCGG CCACGCCATC GTCAACGTCA ACGGCGTGCT TGGCGGCAAG
GACCGCGTCG TATTCGTCCT CGACAGTGAA CCGGACCGTG ACAAGTTCGA CCTGAGGCTG
GCCTACGACG CGCCCAGTGA TGGCGTGATC GCCGCGATGA TGGGCGCGAA GAAGGACGTG
CGCGCCCGTG TGTTCGGCAG GGGCGGCTGG TCAAGCTGGA ATGGTATCGC CTATGCGACG
CAGGACGGCC AGCGGCTGGC CGCGTTCCAG CTCGAAAAGC ACAAGGGTGC CTATCGACTC
GCAGGGCAGG CATGGCCTGG CGATCTGTTG AAGGGCACGT CGGGCAGGGC CGTTGGACCC
GCGCTGTCGC TGCTGTTCGA CGGTACCTTT GCCGACCGGG TGCTGGACGG CAGGCTGCGA
GCGGCGGGCG CGGCGTTCAA GCTGGCGACC GACGGCGGGC TCGACCTTGG CAGCAATGCC
GCGAACGACC TCAAGGTGAA GGCGCGCATA TTGCGACCGG AACTGCTGCT GGCATCGCCG
CAGCTTTCCG GAGTGGCGCT CGATGCCACA CTGGACGGTG CGCTGAAAGA GCTGTCCATC
GAGCATGTGG TGACCGTGGA GCGAATGAAG CTCGGCACGC TTGACGCGCA GGGGCTGCGC
ACGGCTGGCA CCGCGACATG GGATGGAGCA CGCTTCACCC TGCCGCTGGC GGTGACCGCG
CGGCGTGTGG TGACGGGTAA CGCGCTAGTG GACCCGCGCT TTGCCGGCGG GCGGCTGACC
GGCGATCTGG TGTTTGCCGG AAATCGGCTG ACGTCGGAAA ACCTGTCGCT CGCCCTCAAC
GGGCTTGGCG CCCGGCTGGT GCTGCGCGGG GATATCGCGC GGGGAGGCTA TGCGCTGGCT
GGGCCTGTCG CGGCCCGCGG TCTTGCGGTG CCGAACCTGG GCACGGTGGA TGGCAATGCC
AAGATCGTGT TCAAGATCGG CAGCGGTGTG CCATGGACGC TCCAGGCCAA TGTTGCCGGA
CGCATGACGC GCATCAGCAA CGGGACCTTG CAGACGCTGA CGGGAGGCGG ATTGCGCTTT
GCGGGTGGCG CTGCCCTGGG CGAGCGTATT CCGGTGACGT TCCGCAAGAC GACGATCAAT
TCGAACAAGT TGCAGATGAC GCTCGATGGC AAGGTGCTGC CGGGCGGCGC GGCATCGCTG
ACAGGCAGCG GCCGCCACAC CGATTACGGC GCCTTCACTG TCGAAGCGGC GATGACCGGG
AGCGGGCCGA ACGCGGTGCT GGTCTTTGCA AGCCCTTTGC CAGCGGCCGG CCTCAAGGAC
GTGCGGGTGG CGCTTTCGCC CATTGCCGAA GGTTTCCGCA TCGAGACCGA TGGGCAGTCG
ACGTTCGGGC CGTTCAATGG GGCGCTCGGG CTGTTCATGC CGCAAGGCGG CGCGACGCGG
ATCGACATTG AGCGTTTCCG CGTGTGGCAG ACCGATGTCA CCGGTGGTCT CACGCTCGGC
AACGAGGGGG TCGCCGGACA GCTTGCGCTG GTGGGCGGCG GCGTCAACGG AACGGTCGCG
CTTGTGCCCC GAGACGGCGG CCAGGGCTTT GATGCCAACC TGACCGCGCG CAATGCTCGG
TTCGGCGGGA CGCGGCCGCT GTCGATAGGC AACGCCAAGG TCGACGCCAC CGGTCTCATC
AAGGACGGGC ACTCGACAAT CGAGGGCAAT GTGCTGGCCG AAGGCATCGG CATGGGCAAG
ATCTTCATCG GTCGGCTGGC GGCGGCGGCC TATGTCCAGG ATGGCAGCGG ATCGGTCACG
GCCTCGGTGT CCGGCCGGCG TGGGACGCGG TTTGCCTTGC AGGGCACAGC GGCGTTTGCC
CCCGACCAGA TCGTGACATT CGTCTCGGGC GAGTATGCGG GGCGGAGCGT CACCATGCCC
CGGCGCGCGG TGTTGACGCG GGAAGGGCCG GGCTGGCGCC TGGCGCCCAC GCAGATCGGT
TTCGGGCGCG GGATACTGAT TGCGGAAGGC CACATTCTTG GCGGTCCTAC GCAATTGCGC
CTGCTGATGT CGAAGATGCC GCTTTCGGCC GTCGACATCG TGGTGGCCGA TCTCGGACTG
GGCGGCATCG CTTCGGGCAT CGTCGAATAC AACAACGATG GAAAGGGCGC GCCTTCGGGC
AATGCTGCGC TCATCGTGAA GGGCCTTTCG CGCTCGGGCC TGGTCCTGAC CTCGCGGCCG
GTGGACCTCG CGCTGGTAGC GCGGCTCGAT CCCGATGCCC TGCAGACGCG GGCGGTGATC
CGCGAGGGCA ACGAGGTGCG CGGGCGGTTC CAGGCGCGTA TCGGAGGCCT TCCGCGCGGC
GGGGGCTTTG TGGACAGGCT TCAGGCAGGG CAACTGGCTG GGCAGCTTCG CTATTCGGGG
CCGGCAGACG CACTGTGGCG CCTGACCGGT GTCGAGGTGT TCGACTTGAC CGGTCCCTTA
GGCGCGCGGG CTGACATCTC GGGGAGCATT GGCGCTCCCG TCCTGCGAGG CGCGGTGGCA
TCCAAGGGGA TGCGTGTGCA GAGCACGCTT ACCGGTACCG ACTTGCGGCA GGTGGAACTG
GCGGGGACGT TCACCGACTC CACTTTGCAA CTGGCGCGCT TCAGCGGCGT CACGCCCAAT
GGCGGCCGGG TGAGCGGCAG CGGCACGATC GGCCTTGCCG ACCTTGACCA GCATGGGCCT
TCGATTGACC TCAAGCTTTC GGCGCAGAAC GCGCAGCTTA TCAATCGCGA CGACATGGCG
GCAGCCGTCA CCGGGCCGTT GCGCATCGTC AGTTCAGGCG TTGGCGGCAC CATCGCGGGA
CGTGTGCGGA TCGAGCGTGC GCGCTGGGCG CTGGGCCGGG CAACCGCGGC GCGGGAACTG
CCGAACATCG CCACGCGCGA GATCAATGCG CCAGCCGATG CCGCCCCGGC CCGCACGCCG
GCAGCGCCGT GGCGCTTCTT GATCGATGCG AGCGGCGCGA ACCTGATCAA CGTGCGGGGA
CTGGGCCTCG ACAGCGAGTG GGGTGCCGAC ATCCGTCTGC GCGGCACGAC CGCTGCGCCC
CAGATCTTCG GGACGGCGGA CCTGGTGCGC GGCGGCTACG AGTTCGCGGG CAAGCGCTTC
GAACTGACGC GCGGCCGGAT CCGCTTCACC GGCGAAGTGC CGGTCGATCC GCTGCTCGAC
ATCGTGGCTG AGGGCGATGC GAACAACATC AGCGCCAAGA TCACGATCAC CGGCACTGGC
AACCGGCCGA TCATTGCGTT CTCCTCGACC CCGTCGCTGC CGGAAGAGGA ATTGCTGAGC
CGCATCCTGT TCGGCAGCTC GATCACCCAG ATTTCCGCCC CCGAGGCGGT GCAGCTTGCA
TCGGCGCTCG CTTCGCTGCG CGGGGGTGGC GGGTTGGACC CGATCAACAA GCTGCGCGCG
GCCATCGGGC TCGACCGCCT GCGCATCGTC GGTGCCGATC CGACTGTGGG TTCGGGTACG
AGCATCGCGG TGGGCAAGTA CATAGGCCGT CGCTTCTTCG TCGAACTCGT GACCGATGGC
GGAGGCTACA GCGCGACCTC GGTGGAATTC CGCATCACGC GCTGGCTTGC GCTGCTCGCC
ACGATGTCGA CCATCGGGGA CGAGAGCATC AACCTCAAGG CGAGCAAGGA CTACTGA
 
Protein sequence
MAEDAVLPDE PRRPERSRRS RIVRGIARRG AILLVATIAM ILAALVVLDS SLGHRLVADR 
IAALAPGSGL RIEIGRIDGS IYGAAKLRDI RVSDPEGVFL TVPEAELDWR PLSWLKTGLD
VRLLALHRGT LRRAPRLRPS EDTNQPILPD FDIRVDKLVV DNLTVSEALA GRKRRVDVVA
KADIRGGHAI VNVNGVLGGK DRVVFVLDSE PDRDKFDLRL AYDAPSDGVI AAMMGAKKDV
RARVFGRGGW SSWNGIAYAT QDGQRLAAFQ LEKHKGAYRL AGQAWPGDLL KGTSGRAVGP
ALSLLFDGTF ADRVLDGRLR AAGAAFKLAT DGGLDLGSNA ANDLKVKARI LRPELLLASP
QLSGVALDAT LDGALKELSI EHVVTVERMK LGTLDAQGLR TAGTATWDGA RFTLPLAVTA
RRVVTGNALV DPRFAGGRLT GDLVFAGNRL TSENLSLALN GLGARLVLRG DIARGGYALA
GPVAARGLAV PNLGTVDGNA KIVFKIGSGV PWTLQANVAG RMTRISNGTL QTLTGGGLRF
AGGAALGERI PVTFRKTTIN SNKLQMTLDG KVLPGGAASL TGSGRHTDYG AFTVEAAMTG
SGPNAVLVFA SPLPAAGLKD VRVALSPIAE GFRIETDGQS TFGPFNGALG LFMPQGGATR
IDIERFRVWQ TDVTGGLTLG NEGVAGQLAL VGGGVNGTVA LVPRDGGQGF DANLTARNAR
FGGTRPLSIG NAKVDATGLI KDGHSTIEGN VLAEGIGMGK IFIGRLAAAA YVQDGSGSVT
ASVSGRRGTR FALQGTAAFA PDQIVTFVSG EYAGRSVTMP RRAVLTREGP GWRLAPTQIG
FGRGILIAEG HILGGPTQLR LLMSKMPLSA VDIVVADLGL GGIASGIVEY NNDGKGAPSG
NAALIVKGLS RSGLVLTSRP VDLALVARLD PDALQTRAVI REGNEVRGRF QARIGGLPRG
GGFVDRLQAG QLAGQLRYSG PADALWRLTG VEVFDLTGPL GARADISGSI GAPVLRGAVA
SKGMRVQSTL TGTDLRQVEL AGTFTDSTLQ LARFSGVTPN GGRVSGSGTI GLADLDQHGP
SIDLKLSAQN AQLINRDDMA AAVTGPLRIV SSGVGGTIAG RVRIERARWA LGRATAAREL
PNIATREINA PADAAPARTP AAPWRFLIDA SGANLINVRG LGLDSEWGAD IRLRGTTAAP
QIFGTADLVR GGYEFAGKRF ELTRGRIRFT GEVPVDPLLD IVAEGDANNI SAKITITGTG
NRPIIAFSST PSLPEEELLS RILFGSSITQ ISAPEAVQLA SALASLRGGG GLDPINKLRA
AIGLDRLRIV GADPTVGSGT SIAVGKYIGR RFFVELVTDG GGYSATSVEF RITRWLALLA
TMSTIGDESI NLKASKDY