Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3948 |
Symbol | |
ID | 5077432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 123039 |
End bp | 125348 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640481054 |
Product | hypothetical protein |
Protein accession | YP_001165716 |
Protein GI | 146275555 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03187] DGQHR domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0773653 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATG ACGTAATGGG CCCGACCGTC ACGGGGGATG ATCTGGATTC TGAGCTGCGT CAGCGCAAAA GCAAGGACAT CTTTCACACA GTTACGGGTT CAACGCGCAA GGTCATTGCC GAGAAAGTTG CCCTTGAAGA GCAAGACGGG TGGCGCGTCG CAAAGAAAAA CAAGAAGTCG ACGAGGCTTG CGAAGGCCAA GCCTGCGCAC GAACAGCTCG AAGATGAGGT TTGGTGCCTC CTCGCTCAAA TGGGATTCCA GGAGCTGAAC AAGGGCAGGC TCTTCACGAT CGCAGTGGAG GATGGCCTGA ACCCTCGGCA GATCGATGTA TTTGCCAAGG ACGATGAGAC GGTCATCATT GTTGAATGTC GGCAGAAAGC GACAGTCGGG CGGAAGCCAA TGGCGGATCT TATCGAAAAG ATCCGTGCGC TGCGCGAGGC CGCGCAAAAA AGTATCAAAC TCCATTACGG CAGTCAGAAA AAGCTGAAGG TCAAATTCGC CATTGCCACC AGAAACATCA TCTGGGGCGA GGCTGACCTG GAGAAATGTA AAGAGTACCA GATCGCTGTG ATATCGGACC AGCTGCTCGA TTATTACAAG CAACTGACAC AGCACCTCAA AATGGCCGCG CGCTTCCAGT TCCTCGCGCA CATGTTTGAG GGACAGCGGG TCGATGGGCT CGCGCAAACC GTAGTTGCGA CCCGAGGCAA AATGGGTGGA AGGCCGTTCT ACACGTTCCT GATCCGGCCC GAAGAGCTGA TGAAGATCGC CTATGTTGGC CACAAGGGTA GCCGCGACAT CGAAAACCTC GAAACCTACC AGCGAATGCT CCAGTCCGAC CGACTGAAGG GAATTGCGAA GTACATCAAT GAAGGCGGCA AATTCCCGAC TAACATCGTC GTGAACCTCA AGCTCCCCGG TAAGAAGGAA CCACAGTTCG ACAAGAAGGA GACCGTCGGT GAGGAGATAC TCGGTTTCTT GCACCTGCCC CCCATCTATG CCTCAGCATG GGTGATTGAT GGGCAGCACC GCCTCTATGG CTATGCCTAT GCGCGTGAGA ACGGAGGCTT CAAGAGCGAC GAGACCGTTC TACCCGTGCT GGCATACGTC AATCTTCCCG CCGATGAAGA GATGGACCTG TTCATCGACA TCAACAGCAA GCAGGTGAAG GTGAAAACCG GGTTGCTGGT CGAACTCTAT TCGGATCTGC ACTGGAAGTC CGACGATGTC GAAGAGGCCT TCCAGGCCCT GCTGTCGCGG ATCGCCTACC GGCTGAACAA GGACAAGGCT TCTCCACTCT TTGACCGCAT GGTTGTCTCC GGCACCAGGA AAACGAATGT GCGGTGTCTT ACGCAGACGT CGATACGGGA CGGCCTCAAG GTTGCCCGGC TGATTGGCAG TCCCCTTAAG GGCATGATCG TGCCCGGTCC CCTCTCTACG GGTGATCCGC TCAACTACGA CGCCAACCTC AAGAAGAGCC TTTCGGTTCT GACTGAGTGT CTCGCGCTTT TCGCAAACAT TTTGCCCAAC CAATGGGCGG CCGGCGACAG CCCTACTGGC TATGTATGCA CCAACAACGG TCTTCGCGCC CTGTTCTTGG TGATCCAGGA TGTCGCAGAG CATGTCCGTC AAAATTCGGG CATAGACCTC GCGCTACTCA ATGCAGATGA AACGTTCAAA GAACTGGAGC CGTATCTTAC AGCCCTCGCA GATCAACTCG CATCGGTAGC GCCGAACGAT ATTCAGGCAT TTCGCAAGAT CGGATCCTCA CTGACAGCTG TGAAGCAGCA GTCGTTTGGC ATGGAAGCCT ACATTCAGGC GAAACTCTCT GATTTCCGCC CGCTGGGGCT CCAGGAATAC CTGGCCTCGC GCGATGCAGC TGGCGCCGAT GCGGCGGCGG CGAAGGTGAC CCAAATCCAC AAGAAGCTGT TCAACTACGT CATCGAGACT CTGAAAGATC ACTTTGGCCG GGATCACAAA GCGTGGTGGA CCCAGGGCGT ACCGCTTACC ATTCGCCTTT CGTGTACCCA GGAATGGGAA AAGAAGAATC GTGAAGGCGA CGAGGAGTCT CATCTCTACC TCATCAACTA TCAGGATATC GCCGTCGCCA ACTGGGATCT GTTCCGGGAC ACCCTGTCCC TGGGTTATAA GGATCCGGAC AACAAGAAAG AGAGCACCAA GTGGATTAAA GTGCTCAACG ATATCCGCCA ATATACGGCT CACCCTGAAA AAGGCCTGCT CAGCAAGGAA CAGGTCTCAT TCGTGAATGA GGTTTACGAG AAGGTCGAGC ATCATATTCC CGCCCGGTAG
|
Protein sequence | MADDVMGPTV TGDDLDSELR QRKSKDIFHT VTGSTRKVIA EKVALEEQDG WRVAKKNKKS TRLAKAKPAH EQLEDEVWCL LAQMGFQELN KGRLFTIAVE DGLNPRQIDV FAKDDETVII VECRQKATVG RKPMADLIEK IRALREAAQK SIKLHYGSQK KLKVKFAIAT RNIIWGEADL EKCKEYQIAV ISDQLLDYYK QLTQHLKMAA RFQFLAHMFE GQRVDGLAQT VVATRGKMGG RPFYTFLIRP EELMKIAYVG HKGSRDIENL ETYQRMLQSD RLKGIAKYIN EGGKFPTNIV VNLKLPGKKE PQFDKKETVG EEILGFLHLP PIYASAWVID GQHRLYGYAY ARENGGFKSD ETVLPVLAYV NLPADEEMDL FIDINSKQVK VKTGLLVELY SDLHWKSDDV EEAFQALLSR IAYRLNKDKA SPLFDRMVVS GTRKTNVRCL TQTSIRDGLK VARLIGSPLK GMIVPGPLST GDPLNYDANL KKSLSVLTEC LALFANILPN QWAAGDSPTG YVCTNNGLRA LFLVIQDVAE HVRQNSGIDL ALLNADETFK ELEPYLTALA DQLASVAPND IQAFRKIGSS LTAVKQQSFG MEAYIQAKLS DFRPLGLQEY LASRDAAGAD AAAAKVTQIH KKLFNYVIET LKDHFGRDHK AWWTQGVPLT IRLSCTQEWE KKNREGDEES HLYLINYQDI AVANWDLFRD TLSLGYKDPD NKKESTKWIK VLNDIRQYTA HPEKGLLSKE QVSFVNEVYE KVEHHIPAR
|
| |