Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2175 |
Symbol | |
ID | 3918840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2317040 |
End bp | 2320144 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444930 |
Product | hypothetical protein |
Protein accession | YP_497448 |
Protein GI | 87200191 |
COG category | [R] General function prediction only |
COG ID | [COG1483] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0743489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATGG GTTACAATAT TCCTTTGGTC ACCTCTCTCT GCGACATCAG CCCGGACGTG TTCTCGATGA GCCGATCCGA GCAAGTGGAG CACCTCTCAT CGATTGGCGA CGCTGATATT TCGACTGCGC GTGCCTTCTA CGCTCGCAAC CACGTTACAA GCGGAATGTC CGAGTTCCTG CGTGGCGCTA TGCGACGCCT GTCCGGTCAA AGCCAACAGG CGGTCTTCGA GCTTCGTCAG GCGATGGGCG GCGGTAAGAC CCACAACATG ATCGCCCTCG GCCTCTTGGC TCGGTTTCCG GAGTTAAAGG ATCAACTTCC AGACACGATC ACGGCCGGGA TGGGTGACGA GCCTGCTGTA ATCGCCACCG TCAACGGGCG GGACGTTCAG AACTTCATTT GGGGGGACAT CGCCGAGCAG CTTGGTCGGG CGGAGGCGTT CCGGGACCAC TGGGTCAACG GGCCGAAGGA GATGAACGAG GGGGATTGGA TCCGGCTCAT AGGGGACCGG CCCACCCTGA TCATGCTCGA CGAGCTCCCC CCGTACCTCG CCATGGCGCA CACAAAAACG GTCGGTCAGG GGACCCTGCT GGACCTTCTA AAATACTCAA TCGCAAACCT TTTCTCCGCC GCGATGAAGC TCAAACGCTG CGTGGTCGTG GTCGCCTCCC TCGACGCCGC CTATGACGAG GCGCGCCGGA TCTTAGGCGG GCAGCTCGCG GACCTCCAGA AGGAGACGAG CCGCGGCGCG AAGTCGATCA CCCCGGTCGA CCTGAACACG GGCGAGATCT ACGACATCCT GCGCAAACGC CTCTTCACCC GACTCCCGGA CCCGGACGGG GCAGAGGTGG ACAGAGTGGC GCAGGCCTAT CTTGCCGCCT ACCAGGAGGC GATTCGGGGC CGGGCGCTAG CGAAATCTGC CGAGCAGATG GCTGACGAGA TCGTCGCCTC CTACCCGTTC CACCCGAGCT ACAAGGACAT TCTCTCCCTT TTCAAGGAGA ACGAGAAGTT CCGTCAGACC CGGGGCTTAA TCCAGTTCAC CGCGAACCTG ATGCGCGGGG TCTGGGGGAG AAAGGACCAA GAGGAAGTGT TCCTGATCGG GGCGCAGTTC CTCGACTTCT CCGACCAGGA GACCCGAGAT CAGGTAAAGG AGATCGAGCG GTCGCTGGAG TCCGCTCTAG CCAGCGATGT TTATGACACG GACGGATCCG CGCACGCTCA GGGCATCGAC GGGGACCGGA ACGACCGCGC CGCGTCGCAG GTGGCCACCC TCCTGTTCAT CTCGTCGCTG TCGGACAACA CAGACGGGAT CCGGGGCCTG CCCCGCGACA CAGTCGTGGA GTACCTGGTC TCCCCCGGCC AAGAGCCGAC GCGGTTCATC GAGGCGTTCG ACCAGCTTCG GGACCGTTGC TGGTACCTGC ATAATCGGGA CGGGGATCGG TGGTACTTCT CCGACATCGC CAACGTCCGC AAGCAGATCG AGGACAAGGT CGGAAAGATT CCGCAGGACC GGGTCGACGA GGAGATGCGT CGCCGCCTCA CGGACATCTT CCGTCCCGTC AACAAGCTCG CCTACTCTGA GCTGGTGGTC CTTCCGCGGG TCGATGAAGT GAACCTGACC CCGTCTAAGC GGACCTGTCT GGTCCTCTCC CCCGACGCGA AGTCCCCGCC CGCTGCGGCA GCGCGGTTCT TCGACGACGT GGTTTACAAG AACGCGTTCT GCGTGGTCGC CGGGGACGGG TCGAAGATGG CCAGCGCGGA GGACAGCGTC CGTCGCCTCC TTGCGATCGC TGCCGTGAAG ACGATCGTGG CGGACACCCC CCGGCACCAG CGGGAAATTG AGACGGAGCA GGAAACGACC GAGATTGGGT TCAACTCCAC CGTCAAGAGC CTCTTCAACG CGGTTTGGTA CCCGCAGACG AAGGAGCTGA AGAGCGCCCG GATCGACCTC GGCCACTTCC AGGAGCGGGG AGTGATCCAA GGGGAGAAGG CGGTTGAGGC GGCTCTCGCG GGAGGAGGGG CAAAGAAGCT CGTCGAGCTG GACCCGGAGA AAACGGACGG GCTGATCCAG CGCTGCGAAG ACCAGCTGTT CCCGGAGAAC CAGTCCCGCA CCCGCTGGTC GGACGTGCTG GAGCGGGCCG CGTCGAACCC TCGCTGGATA TGGCTCCCCC CGAAAGGAAT GGAGGAGATC AAGGCAGCCG CCCTCGCGGA GGGCCGCTGG GTCGAGGAGA ACGGCTACGT AGACAAGAGC CCGCCCCCAC CACAGCCACT GATCAGGGTC ACTCGGATCG GCGGGGACGA GGCGACCGGG GAGAGCGAAC TGGAGCTCGC CGTCTCGAAC GCCGGGCGGG CACCCGAAGT GCTGGTTGCA CCGACCCGGG AGGGGCTGGA CGCCGGTGAG ATCATCACGG ACCGAACCTA TCGGACCACG GAGGTGGAAC TCTGGTTCCA GGCCCGTAAC CCGGAAACAG GGGAGGTGAG CGAGCCGTAC CGCTGGGCCG GGTCTATTAC GATCACTCAT GAGCGGCGTG ATAACGCGGG CATGTGGCAG GTGACCCTCG CAGCGCGTCC CGAGGCGGAG CTGCGCTGGA ACACCTTGGG GATCAACCCG AAGGACGGGT CGCTCTATAA TGGCGGAGCG ATAGAGATCG ACGGGAGGCA AAAGACCACT CTCTACGTCT ACGCGGTCAA GGGGGGCGTG TCCGCGCAGC GGACCTTCGC CTTCGACGCG GTCGGCGCAC AGCGAACCAT CAACAACGAG CGCCCCGCCA AGGCGAAACG TGACTTCCAG TTCGCCTCCA AGGGGGAGGT TCTCCGGGTC GTGCGGGCCG CGAAGGGGAA GGAGAGCGTC CGGTTCCACG GGGTCAGCGT CACTGTTGGT GAGGGGGAGC GGAGCCTCCG GGTCCGCAGC GGCGGGGACG TTGCCCTGTC GGGTGCGGAC ATCGAGACGA TGATCGACGG GCTGCGCGGT GCGCTCGGGC AGCCGGACGC GGAGGTGCAA CTCCGGTTTC GGGAGGCGGA CTTCCCTGAT GGGTATGCCC TGAAGGACTT CGCGACCCAA GTCGGGATCG ACATCCCGGT TGAGGACGTG GAGCAGGAGG CCTGA
|
Protein sequence | MSMGYNIPLV TSLCDISPDV FSMSRSEQVE HLSSIGDADI STARAFYARN HVTSGMSEFL RGAMRRLSGQ SQQAVFELRQ AMGGGKTHNM IALGLLARFP ELKDQLPDTI TAGMGDEPAV IATVNGRDVQ NFIWGDIAEQ LGRAEAFRDH WVNGPKEMNE GDWIRLIGDR PTLIMLDELP PYLAMAHTKT VGQGTLLDLL KYSIANLFSA AMKLKRCVVV VASLDAAYDE ARRILGGQLA DLQKETSRGA KSITPVDLNT GEIYDILRKR LFTRLPDPDG AEVDRVAQAY LAAYQEAIRG RALAKSAEQM ADEIVASYPF HPSYKDILSL FKENEKFRQT RGLIQFTANL MRGVWGRKDQ EEVFLIGAQF LDFSDQETRD QVKEIERSLE SALASDVYDT DGSAHAQGID GDRNDRAASQ VATLLFISSL SDNTDGIRGL PRDTVVEYLV SPGQEPTRFI EAFDQLRDRC WYLHNRDGDR WYFSDIANVR KQIEDKVGKI PQDRVDEEMR RRLTDIFRPV NKLAYSELVV LPRVDEVNLT PSKRTCLVLS PDAKSPPAAA ARFFDDVVYK NAFCVVAGDG SKMASAEDSV RRLLAIAAVK TIVADTPRHQ REIETEQETT EIGFNSTVKS LFNAVWYPQT KELKSARIDL GHFQERGVIQ GEKAVEAALA GGGAKKLVEL DPEKTDGLIQ RCEDQLFPEN QSRTRWSDVL ERAASNPRWI WLPPKGMEEI KAAALAEGRW VEENGYVDKS PPPPQPLIRV TRIGGDEATG ESELELAVSN AGRAPEVLVA PTREGLDAGE IITDRTYRTT EVELWFQARN PETGEVSEPY RWAGSITITH ERRDNAGMWQ VTLAARPEAE LRWNTLGINP KDGSLYNGGA IEIDGRQKTT LYVYAVKGGV SAQRTFAFDA VGAQRTINNE RPAKAKRDFQ FASKGEVLRV VRAAKGKESV RFHGVSVTVG EGERSLRVRS GGDVALSGAD IETMIDGLRG ALGQPDAEVQ LRFREADFPD GYALKDFATQ VGIDIPVEDV EQEA
|
| |