Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1103 |
Symbol | |
ID | 3916399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1146414 |
End bp | 1150610 |
Gene Length | 4197 bp |
Protein Length | 1398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640443838 |
Product | hypothetical protein |
Protein accession | YP_496382 |
Protein GI | 87199125 |
COG category | [S] Function unknown |
COG ID | [COG2911] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAGG ACGCCGTCCT TCCCGACGAA CCCCGGCGGC CTGAACGTTC GCGCCGGTCG CGGATCGTGC GCGGCATCGC GCGGCGAGGC GCGATCCTGC TGGTGGCCAC GATCGCGATG ATTCTCGCGG CGCTCGTTGT TCTCGACAGT TCGCTTGGCC ACAGGCTGGT TGCGGACCGG ATCGCGGCAC TCGCCCCCGG ATCGGGCCTG AGGATCGAGA TCGGCCGCAT CGACGGCTCG ATCTATGGCG CGGCAAAGTT GCGCGATATC CGCGTGAGCG ATCCGGAAGG CGTGTTCCTG ACCGTGCCCG AGGCGGAACT GGACTGGCGC CCGCTGTCCT GGCTGAAGAC GGGCCTCGAC GTGCGCCTGT TGGCGTTGCA TCGCGGGACG CTGCGCCGCG CGCCGCGCCT GCGGCCCAGC GAGGACACGA ACCAGCCGAT CCTGCCCGAC TTCGACATTC GCGTGGACAA GCTGGTGGTC GACAACCTGA CCGTTTCCGA AGCCCTTGCC GGGCGCAAGC GGCGGGTCGA TGTGGTGGCG AAGGCCGACA TCCGGGGCGG CCACGCCATC GTCAACGTCA ACGGCGTGCT TGGCGGCAAG GACCGCGTCG TATTCGTCCT CGACAGTGAA CCGGACCGTG ACAAGTTCGA CCTGAGGCTG GCCTACGACG CGCCCAGTGA TGGCGTGATC GCCGCGATGA TGGGCGCGAA GAAGGACGTG CGCGCCCGTG TGTTCGGCAG GGGCGGCTGG TCAAGCTGGA ATGGTATCGC CTATGCGACG CAGGACGGCC AGCGGCTGGC CGCGTTCCAG CTCGAAAAGC ACAAGGGTGC CTATCGACTC GCAGGGCAGG CATGGCCTGG CGATCTGTTG AAGGGCACGT CGGGCAGGGC CGTTGGACCC GCGCTGTCGC TGCTGTTCGA CGGTACCTTT GCCGACCGGG TGCTGGACGG CAGGCTGCGA GCGGCGGGCG CGGCGTTCAA GCTGGCGACC GACGGCGGGC TCGACCTTGG CAGCAATGCC GCGAACGACC TCAAGGTGAA GGCGCGCATA TTGCGACCGG AACTGCTGCT GGCATCGCCG CAGCTTTCCG GAGTGGCGCT CGATGCCACA CTGGACGGTG CGCTGAAAGA GCTGTCCATC GAGCATGTGG TGACCGTGGA GCGAATGAAG CTCGGCACGC TTGACGCGCA GGGGCTGCGC ACGGCTGGCA CCGCGACATG GGATGGAGCA CGCTTCACCC TGCCGCTGGC GGTGACCGCG CGGCGTGTGG TGACGGGTAA CGCGCTAGTG GACCCGCGCT TTGCCGGCGG GCGGCTGACC GGCGATCTGG TGTTTGCCGG AAATCGGCTG ACGTCGGAAA ACCTGTCGCT CGCCCTCAAC GGGCTTGGCG CCCGGCTGGT GCTGCGCGGG GATATCGCGC GGGGAGGCTA TGCGCTGGCT GGGCCTGTCG CGGCCCGCGG TCTTGCGGTG CCGAACCTGG GCACGGTGGA TGGCAATGCC AAGATCGTGT TCAAGATCGG CAGCGGTGTG CCATGGACGC TCCAGGCCAA TGTTGCCGGA CGCATGACGC GCATCAGCAA CGGGACCTTG CAGACGCTGA CGGGAGGCGG ATTGCGCTTT GCGGGTGGCG CTGCCCTGGG CGAGCGTATT CCGGTGACGT TCCGCAAGAC GACGATCAAT TCGAACAAGT TGCAGATGAC GCTCGATGGC AAGGTGCTGC CGGGCGGCGC GGCATCGCTG ACAGGCAGCG GCCGCCACAC CGATTACGGC GCCTTCACTG TCGAAGCGGC GATGACCGGG AGCGGGCCGA ACGCGGTGCT GGTCTTTGCA AGCCCTTTGC CAGCGGCCGG CCTCAAGGAC GTGCGGGTGG CGCTTTCGCC CATTGCCGAA GGTTTCCGCA TCGAGACCGA TGGGCAGTCG ACGTTCGGGC CGTTCAATGG GGCGCTCGGG CTGTTCATGC CGCAAGGCGG CGCGACGCGG ATCGACATTG AGCGTTTCCG CGTGTGGCAG ACCGATGTCA CCGGTGGTCT CACGCTCGGC AACGAGGGGG TCGCCGGACA GCTTGCGCTG GTGGGCGGCG GCGTCAACGG AACGGTCGCG CTTGTGCCCC GAGACGGCGG CCAGGGCTTT GATGCCAACC TGACCGCGCG CAATGCTCGG TTCGGCGGGA CGCGGCCGCT GTCGATAGGC AACGCCAAGG TCGACGCCAC CGGTCTCATC AAGGACGGGC ACTCGACAAT CGAGGGCAAT GTGCTGGCCG AAGGCATCGG CATGGGCAAG ATCTTCATCG GTCGGCTGGC GGCGGCGGCC TATGTCCAGG ATGGCAGCGG ATCGGTCACG GCCTCGGTGT CCGGCCGGCG TGGGACGCGG TTTGCCTTGC AGGGCACAGC GGCGTTTGCC CCCGACCAGA TCGTGACATT CGTCTCGGGC GAGTATGCGG GGCGGAGCGT CACCATGCCC CGGCGCGCGG TGTTGACGCG GGAAGGGCCG GGCTGGCGCC TGGCGCCCAC GCAGATCGGT TTCGGGCGCG GGATACTGAT TGCGGAAGGC CACATTCTTG GCGGTCCTAC GCAATTGCGC CTGCTGATGT CGAAGATGCC GCTTTCGGCC GTCGACATCG TGGTGGCCGA TCTCGGACTG GGCGGCATCG CTTCGGGCAT CGTCGAATAC AACAACGATG GAAAGGGCGC GCCTTCGGGC AATGCTGCGC TCATCGTGAA GGGCCTTTCG CGCTCGGGCC TGGTCCTGAC CTCGCGGCCG GTGGACCTCG CGCTGGTAGC GCGGCTCGAT CCCGATGCCC TGCAGACGCG GGCGGTGATC CGCGAGGGCA ACGAGGTGCG CGGGCGGTTC CAGGCGCGTA TCGGAGGCCT TCCGCGCGGC GGGGGCTTTG TGGACAGGCT TCAGGCAGGG CAACTGGCTG GGCAGCTTCG CTATTCGGGG CCGGCAGACG CACTGTGGCG CCTGACCGGT GTCGAGGTGT TCGACTTGAC CGGTCCCTTA GGCGCGCGGG CTGACATCTC GGGGAGCATT GGCGCTCCCG TCCTGCGAGG CGCGGTGGCA TCCAAGGGGA TGCGTGTGCA GAGCACGCTT ACCGGTACCG ACTTGCGGCA GGTGGAACTG GCGGGGACGT TCACCGACTC CACTTTGCAA CTGGCGCGCT TCAGCGGCGT CACGCCCAAT GGCGGCCGGG TGAGCGGCAG CGGCACGATC GGCCTTGCCG ACCTTGACCA GCATGGGCCT TCGATTGACC TCAAGCTTTC GGCGCAGAAC GCGCAGCTTA TCAATCGCGA CGACATGGCG GCAGCCGTCA CCGGGCCGTT GCGCATCGTC AGTTCAGGCG TTGGCGGCAC CATCGCGGGA CGTGTGCGGA TCGAGCGTGC GCGCTGGGCG CTGGGCCGGG CAACCGCGGC GCGGGAACTG CCGAACATCG CCACGCGCGA GATCAATGCG CCAGCCGATG CCGCCCCGGC CCGCACGCCG GCAGCGCCGT GGCGCTTCTT GATCGATGCG AGCGGCGCGA ACCTGATCAA CGTGCGGGGA CTGGGCCTCG ACAGCGAGTG GGGTGCCGAC ATCCGTCTGC GCGGCACGAC CGCTGCGCCC CAGATCTTCG GGACGGCGGA CCTGGTGCGC GGCGGCTACG AGTTCGCGGG CAAGCGCTTC GAACTGACGC GCGGCCGGAT CCGCTTCACC GGCGAAGTGC CGGTCGATCC GCTGCTCGAC ATCGTGGCTG AGGGCGATGC GAACAACATC AGCGCCAAGA TCACGATCAC CGGCACTGGC AACCGGCCGA TCATTGCGTT CTCCTCGACC CCGTCGCTGC CGGAAGAGGA ATTGCTGAGC CGCATCCTGT TCGGCAGCTC GATCACCCAG ATTTCCGCCC CCGAGGCGGT GCAGCTTGCA TCGGCGCTCG CTTCGCTGCG CGGGGGTGGC GGGTTGGACC CGATCAACAA GCTGCGCGCG GCCATCGGGC TCGACCGCCT GCGCATCGTC GGTGCCGATC CGACTGTGGG TTCGGGTACG AGCATCGCGG TGGGCAAGTA CATAGGCCGT CGCTTCTTCG TCGAACTCGT GACCGATGGC GGAGGCTACA GCGCGACCTC GGTGGAATTC CGCATCACGC GCTGGCTTGC GCTGCTCGCC ACGATGTCGA CCATCGGGGA CGAGAGCATC AACCTCAAGG CGAGCAAGGA CTACTGA
|
Protein sequence | MAEDAVLPDE PRRPERSRRS RIVRGIARRG AILLVATIAM ILAALVVLDS SLGHRLVADR IAALAPGSGL RIEIGRIDGS IYGAAKLRDI RVSDPEGVFL TVPEAELDWR PLSWLKTGLD VRLLALHRGT LRRAPRLRPS EDTNQPILPD FDIRVDKLVV DNLTVSEALA GRKRRVDVVA KADIRGGHAI VNVNGVLGGK DRVVFVLDSE PDRDKFDLRL AYDAPSDGVI AAMMGAKKDV RARVFGRGGW SSWNGIAYAT QDGQRLAAFQ LEKHKGAYRL AGQAWPGDLL KGTSGRAVGP ALSLLFDGTF ADRVLDGRLR AAGAAFKLAT DGGLDLGSNA ANDLKVKARI LRPELLLASP QLSGVALDAT LDGALKELSI EHVVTVERMK LGTLDAQGLR TAGTATWDGA RFTLPLAVTA RRVVTGNALV DPRFAGGRLT GDLVFAGNRL TSENLSLALN GLGARLVLRG DIARGGYALA GPVAARGLAV PNLGTVDGNA KIVFKIGSGV PWTLQANVAG RMTRISNGTL QTLTGGGLRF AGGAALGERI PVTFRKTTIN SNKLQMTLDG KVLPGGAASL TGSGRHTDYG AFTVEAAMTG SGPNAVLVFA SPLPAAGLKD VRVALSPIAE GFRIETDGQS TFGPFNGALG LFMPQGGATR IDIERFRVWQ TDVTGGLTLG NEGVAGQLAL VGGGVNGTVA LVPRDGGQGF DANLTARNAR FGGTRPLSIG NAKVDATGLI KDGHSTIEGN VLAEGIGMGK IFIGRLAAAA YVQDGSGSVT ASVSGRRGTR FALQGTAAFA PDQIVTFVSG EYAGRSVTMP RRAVLTREGP GWRLAPTQIG FGRGILIAEG HILGGPTQLR LLMSKMPLSA VDIVVADLGL GGIASGIVEY NNDGKGAPSG NAALIVKGLS RSGLVLTSRP VDLALVARLD PDALQTRAVI REGNEVRGRF QARIGGLPRG GGFVDRLQAG QLAGQLRYSG PADALWRLTG VEVFDLTGPL GARADISGSI GAPVLRGAVA SKGMRVQSTL TGTDLRQVEL AGTFTDSTLQ LARFSGVTPN GGRVSGSGTI GLADLDQHGP SIDLKLSAQN AQLINRDDMA AAVTGPLRIV SSGVGGTIAG RVRIERARWA LGRATAAREL PNIATREINA PADAAPARTP AAPWRFLIDA SGANLINVRG LGLDSEWGAD IRLRGTTAAP QIFGTADLVR GGYEFAGKRF ELTRGRIRFT GEVPVDPLLD IVAEGDANNI SAKITITGTG NRPIIAFSST PSLPEEELLS RILFGSSITQ ISAPEAVQLA SALASLRGGG GLDPINKLRA AIGLDRLRIV GADPTVGSGT SIAVGKYIGR RFFVELVTDG GGYSATSVEF RITRWLALLA TMSTIGDESI NLKASKDY
|
| |