Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3983 |
Symbol | |
ID | 5077513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 149792 |
End bp | 152833 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481089 |
Product | hypothetical protein |
Protein accession | YP_001165751 |
Protein GI | 146275590 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member |
TIGRFAM ID | [TIGR02686] conjugative relaxase domain, TrwC/TraI family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTCGA TCGCCTCCGT CCGGTCGTCG AGCGGCGCTG CCGACTACTT CGCCAACGAC AACTACTACT CGGCCGATGA GCACGCGGAA GCGGGGGTCT GGGGCGGCGA AGGTGCGCGC GCGCTGGGGC TTGAGGGGCA GGTCGAGCGC GATGCGTTCG AGGGCGTGCT CAATGGGCGG CTGCCCGACG GCGAGATGGT CGGGCAGGTC GAAGGTCGGC GCCTGGGTCT CGATCTCACC TTCTCGATGC CCAAGTCTGC CTCGATCCTG GCGCTGGTCA GCGGCGACCG GCGGATCATC GATGCGCACC TGGCGGCGGT CAAATCGACC ATGTCGCAGC TTGTCGAAAA GCAGTTCGCC GAGAGCCGCA ACTATGAGCG CAGCCGCAGC GGCGAACCCC AGAAGACCGG TAACCTCGTC TATGCCCTGT TCGCCCACGA TACGAGCCGC GCGCTCGATC CGCAGGGGCA TATCCACGCC GTCGTCGCCA ATCTGACTCG CGATCCCAAG GGCACCTGGA AGGCGCTGTG GAACGGCGAG ATCTGGAAGA ACAACACCAC GATCGGCCAG TTCTACCACG CCGCCTTCCG CGCCCAGTTG CAGAAGCTCG GCTACGAAAC CGAGGCTGCC GGCAAACACG GGTCGTTCGA GATCAAGGGC GTGCCCGCCG AGGTGATCAA GGCCTTCTCG ACCCGCACCA CCGAGATCGA GGCGAAGATC GCCGAGGTCG GGGCAACCCG CCTCGAAACC AAGAAGCAGA TCACGCTTTA TACCCGCGAT CCCAAGCTGG CGGTCGAGGA TCGCGGCGCA CTGGTCGAGG GCTGGCAGCA GCGCGCGGCC GAGCTCGGGT TCGACGGCAA GGCTCTTGTT GCCGCAGCCA AGGCGCGCGC CGAAGTCGAG GCGCGGCCGA GCTTCCGCGA GACGGCCACG GCCGCGATCG GCGAAGTTTC GACCCGGATC AACGCCGCGC TGCGCACCCC AAGTCCGCTT GCCGTGAGCG GGGCCGCAGC ATTGTTCCTG TCGGCCGCCA CCATCAAGGC TCAGCACGCC ACCGCTTCGG CGATCCGTCA TCTCTCCGAG CGCGAGGCAG CGTTCTCGCC GCAGGCAATC CTGGCGAGCG CGCTGGGGTT CCATATCAAG GGCCTCGAGG GCGGCGCAGT GGTCCAGCGA ATTGGCGAGC TCATTCGCGA CGGGCACCTG ATCCCCGGCA AGTCGGACCG GCTCGACGGG CACTCCGATC TCGTGACCAC GCCGGCCGCA CTGGCCATGG AGCAACGCAT CCTTGACACC ATCGAGCGGG GTCATGGCGA GGGCCGCGCC TTCATGCCGC CCGAGGCGGC GATGACCCGG CTGCAGGAGG CCGCGCGTGA ACTTGGGCGC GAGCGTGCCG GGGTGGACAC CTGGCAACTC AACGAAGGCC AACTCGCGGC GGGCGTGGCG ATCCTCTCAG GCGGCGATCG CTTCCTCAAC GTCCAGGGCG TCGCCGGTGC GGGCAAATCC ACCCTGCTAG GCGCACTCGA CAAGGTGCTG GACGCGGAAG GCGTGAAGCT TGTCGGGCTC GCTTTCCAGA ACAAGATGGT CGCTGATCTG CGCGGTGGCG GCGGTCAGGG CATGTCGGCC GACCAGATGC GCGAGGCGGG GATCGAAGCC TATACCATCG CCCGGTTCCT CAATACCTAC AGCTCGGCGG CCGCTGCAGG CAGCGGCGAG CGCTACGAGG CAGCGAAGGC TGCGCTCGCC AATACCGTCA TCATCACCGA CGAAAGCTCG ATGGTTTCCT CGCGCGACAT GCTGCGCCTG GTGACGCTCG CCGAGCAGCT CGACCTCGCC AAGGCGCCGT TCATGGGCGA CCGCCAGCAA TTGTCGGCGA TCGAGCAGGG CAAGATGTTC GCGGTCTCGC AGGCCTCTGG TCAGGCGACC GTGCGGATGG ATGAGAACAT ACGCCAGAAG AACTCGCCGC TTCTCCTCGC AGTTGCGGGG CTCTCGAACG AGGGCCATGC CGGCCTCGCA CTTGACCTGC TTGCCGCGCA CGGCCGCGTA ATCGAAGCAG GGCCCGACCA TGTCGCCCGC GCCGCCGAGC TGTGGCTCTC GCTCGAACCC GACAAGCGCG AAGCGACCGC GATCTTCACC GCCGGGCGTG ACGATCGCGC CGAGATCAAC GCTCGGGTCC AGGCGGGGTT GCTCAAGGAA GGGACATTGA GCGGCGCGGG CGTGCCGCTG GCGACCCTCC AGAGCGTCAA CGCCACCCGC GAGGAGCTGC GGTTCGCCTC GACCTACCGG GTGGGTCAGG TGCTTGAGGC GCGGATGGAG GTGCGCGAGA TCGGCCTCAA GGCCGGCGAA TACCGGGTGA GCGAGGTGCG CAAGGACGGC AAGGTCGTGC TCGAGCGCGA TGGCAAGCGC AAGGTCATCG ATCCTGACCG GATCAATCCC GATCACCGCT TCGACCGGCT CGGGCTTCAT GAAGAGAAGC AGATCCGCCT CCACGACGGC GAAACCGTGT TTTGGCGTGA CAAGGACGGG TCCCGAGACA TTGCCAAGTC GACCTACGCC ACCGTGCTCG AGGCGCGAGC AGAAGGCATC CGGGTCGAAC TTGCGGACAA GCGGCAGCTT GTCCTGGTCC CCGGCGATCC GATGCTGCGC CGTCTCGACC TTGGCTATGC GCTCAACGCT CACATGGCGC AGGGCATGAC CAAGCCACAG GCGATCGAGG TCATCTCTTC GACCCAGCGC AACCTTGCCA CCCAGCGCAC GCAGAACGTC CTCAATACCC GCGCCACTGA CGACATGACC GTCGTCACCA ACAACCTCGA AGGCCTGAAA CAGCAACTCG ACCGCACGCC CGGCAACAAG ACCTCAGCGC TCGAAGTCAC CGGTAAGGTC GAGGTTGAGC CGCGCCATGT GAACCCGATC GATACCCGGC AGGTGCCCGA ACTCCGCATG AGCCCCGAGC TCAAGGCCAA GCTCGACGCA GTGCTCGGCA AGGTGGCGGA GGCGCCGGTG AAGCAGATGC GCCCGGAAAA GACGCTGGGG CTCGACCTGT GA
|
Protein sequence | MHSIASVRSS SGAADYFAND NYYSADEHAE AGVWGGEGAR ALGLEGQVER DAFEGVLNGR LPDGEMVGQV EGRRLGLDLT FSMPKSASIL ALVSGDRRII DAHLAAVKST MSQLVEKQFA ESRNYERSRS GEPQKTGNLV YALFAHDTSR ALDPQGHIHA VVANLTRDPK GTWKALWNGE IWKNNTTIGQ FYHAAFRAQL QKLGYETEAA GKHGSFEIKG VPAEVIKAFS TRTTEIEAKI AEVGATRLET KKQITLYTRD PKLAVEDRGA LVEGWQQRAA ELGFDGKALV AAAKARAEVE ARPSFRETAT AAIGEVSTRI NAALRTPSPL AVSGAAALFL SAATIKAQHA TASAIRHLSE REAAFSPQAI LASALGFHIK GLEGGAVVQR IGELIRDGHL IPGKSDRLDG HSDLVTTPAA LAMEQRILDT IERGHGEGRA FMPPEAAMTR LQEAARELGR ERAGVDTWQL NEGQLAAGVA ILSGGDRFLN VQGVAGAGKS TLLGALDKVL DAEGVKLVGL AFQNKMVADL RGGGGQGMSA DQMREAGIEA YTIARFLNTY SSAAAAGSGE RYEAAKAALA NTVIITDESS MVSSRDMLRL VTLAEQLDLA KAPFMGDRQQ LSAIEQGKMF AVSQASGQAT VRMDENIRQK NSPLLLAVAG LSNEGHAGLA LDLLAAHGRV IEAGPDHVAR AAELWLSLEP DKREATAIFT AGRDDRAEIN ARVQAGLLKE GTLSGAGVPL ATLQSVNATR EELRFASTYR VGQVLEARME VREIGLKAGE YRVSEVRKDG KVVLERDGKR KVIDPDRINP DHRFDRLGLH EEKQIRLHDG ETVFWRDKDG SRDIAKSTYA TVLEARAEGI RVELADKRQL VLVPGDPMLR RLDLGYALNA HMAQGMTKPQ AIEVISSTQR NLATQRTQNV LNTRATDDMT VVTNNLEGLK QQLDRTPGNK TSALEVTGKV EVEPRHVNPI DTRQVPELRM SPELKAKLDA VLGKVAEAPV KQMRPEKTLG LDL
|
| |