Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1998 |
Symbol | |
ID | 3917318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2129827 |
End bp | 2132808 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640444749 |
Product | helicase |
Protein accession | YP_497271 |
Protein GI | 87200014 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3893] Inactivated superfamily I helicase |
TIGRFAM ID | [TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.138335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACG CCCGCGCCCG CCCGCGCGTC TGGTCGATCG CGGCGCATCG CGGCTTTGCC GACGCGCTCG TCGCGGGTCT CGTCCCGCGT TATCGCGAGG ATCGCTTCGG CCTCGCCCGG CTGACGCTTC TGCTGCCCAG CCAACGTGCC GTGCGCACGG TGACCGAGGC ATTCGTGCGT GCCAGCGGGG CTGGCCTGCT GCTGCCGCGC ATGACCGTCG TCGGCGACCT CGATCTTGAC GAGACGCTGG GGCCGCTGCT CGATCCCATA GGGGCGGGCG TGGACATGCC CGAGGCTGTC GACCCGGTTT GGCGGCTCCT GCGCATAGCG GGCATCCTGC GCGACGAACT GGGCGAGGAC GCGCCGGGCG AGGCGGCGCT GCTGCGGCAG GCGCGCGGGA TCGCGCAAGG GATCGACCGC CTGCTCGTCG AGGGCGTGCA GCCCGAACGC ATGCTCGACG AGGCGGTGAT CGGGATTGCC GCCGAGCTTT CGGAGCATTG GCAGGAAAGC ACCCGTCTGT TTGCGCGGGT GTTCTTCCGC TGGCGCGCGG AGCTGGAAGC GATTGGCAAG GTCGATGCGC CAGAGCGGCG CAACCGCCTG CTCGACCACG CCGCGCGAAG CTGGCGCGAG AGGCCGCCGG CGCATCCGGT CATCGCGGCC GGCGTGACTT CCGCCTCGCC AGTGGTGGCA AGGCTGTTGC GCACGGTTGC CGACATGCCC GAAGGAGGCG TCGTCCTCCC CGATCTCGAC CTCGCGCTCG ACCCGGAGGT CTGGGACGCG CTGGGGAGCG CGGGTGGGCC TGACGGGGGC CTGTTCGAGC GTGGCGACGT GGTGACGCAC CCGCAGTACC ACCTGAAGCT CCTGCTCAAC CGGATGGGCA TCGCACGCGA CGAAGTGCAG CCCTGGCACC GCGCTGGTCT TGCCGCCGCC CCGCCCGAAC GAAGCCGCGC GATCTCCAAT CTCTTCCTGC CGCCGGAAGC CAGCGCCGCC TGGGTCTCGC TTGAGGCGCG CGAGCGCCGT CTGGCCGGCG TCAGGGTCAT GGAAACCGCG CATCCGGAAG AGGAAGCGCA GGCCATCGCC GTCCTTGTCC GCGAGGCGCT GAAAGAACCG GAGCGCCGCG TTGCGGTGAT AACCCCCGAC CGCAGCCTTG CCGCGCGCAT CGTGGCGCAT CTTGGCCGGT GGAACATCGG CGCCGACGAC ACCGCCGGCC GTCCTCTGCC GCAGACGGCG GCGGGAAGGC TGCTGCTGCA ATTGGCCGAG GTCGTGGCCG AACGCGCGGC ACCGGTGCCG CTGCTCGCGC TGCTCGGCCA CCCGCTCGTG CAGGGCGGGG AAGGGCGTCC GGTGTGGCTG GAGCGCGTGC GCCAGCTCGA TCTGGTCCTG CGCGGGCCGC GCCCTGGTCC GGGCCTGCCG GCAATCCGGC AGGCGGTGGA TAAAACGGCG AAACGCTTCC CGGCGCTGCC CGACTGGTGG TCGGGCGTCG AGGACCTGCT TTTCCCGCTC GTCCCGCTCG AAGGCGCGGT CCCTCTCGAC ATGGCCCTCG TCGCGCTGGT CGAGGCAGGC GAGGCGCTTT GCGGAACTGC CCTGTGGGCG CAGGCAGACG GGCGCAGCCT TGCCGCCTTC GTCGAACGCT GGCGCGATGC AGCAGGCGAT GCGCCGGCCA TGGTCGATGT CGCGGAACTG CCCTCGTTGC TGCGCGATGC GATGGAGGAG ATCTCTGTCC GCCCACCTTG GGGCGGCCAC CCGCGCCTCG CGATCTACGG CCTGCTCGAA GCGCGAATGA GCCGTGCGGA CCTCGTCATC TGCGGTGGGC TGACCGAAGG CACCTGGCCG GGCAGCCCCG CGCCCGATCC GTTGCTGGCC CCGGCGATCC TGCGCGCGCT CGGCATTCCC GGCGCGGAGT TCCGCATCGG CCTGTCGGCT CATGACCTTG CCGCTGCACT GGGCGCACCC GAAGTGGTCC TGAGCCACGC CCGGCGCGAC GCGAGCGGCC CGGTGATCCC CTCTCGCTTC CTGCTGCGGA TCCACGCGAT GCTGGGCGAC CAGTTGCGCA TCGAGGAGCG CGCGGTGGAG CTTGCCAGGG CGCTTGCGGA CGCTGACCGC ATCGCCCCGC ATCCGCAACC GCGCCCGATG CCTTCGGCAG AGCAGCGTCG CGTCCCCATA GCCGTCACCG CGCTCGATCG GCTGCGCGGC GATCCCTACC AGTTCTATGC CTCGGCGATC CTTGGCCTGA GGAGCCTCGA TCCGATCGAT GCCGATCCGA CGCCCGCCTG GAAGGGCACG GCGGTCCATG ACGTGCTCAA GGCATGGCAC GAGTCCGGCG GCGTCCCGGG CCAGCTCGTT CCACTGGCCG AGCGCATGTT CGACGAGATG AGCGCGCACC CGTTCATGCG CACCATGTGG AAGCCGCGCC TTGTGGACGC GCTGCACTGG ATCGAGGAGG AGACGGATCG GCTTGCCGGG GAAGGCCGCG AAGTCCTCGC CGTGGAACGC AAGGGCGAGA TCGTGGTCGA CGGCATCCGC ATCCACGGTC GCGCGGACCG TATCGACCGG CTGCCCGACG GTACGCTCGC GGTGGTCGAC TACAAGACGG GAAAACCGCC TTCGGGCAAG ATGGTGGCCG AGGGCTTCGC CTTGCAGCTC GGCCTGATCG GCCTGATCGC ACGCGGCGGC GGCATGGACG GTGTGGCGGG AGAGCCCACG GCGTTCGAAT ACTGGTCGCT TGGCCGCAAC AAGGAACGCG GCTTCGGCTA CATGAAGTCT CCGGTGAAGG AGACGGCGCG CCAGACCGGC ATCCCGAGGG AAGAGTTTCT CGACCGCACG GAGGACTACC TGCACGAAGC CATAGCGCGC TGGTTGCTTG GATCGGAACC CTTCACCGCA AGGCTCAATC CCGACCTTCC GGGCTATTCT GACTACGACC AGCTCATGCG CCTCGATGAA TGGCAGGGCC GTGAGCGCAA GGGAGGCGGC GGCGAGCCAT GA
|
Protein sequence | MDDARARPRV WSIAAHRGFA DALVAGLVPR YREDRFGLAR LTLLLPSQRA VRTVTEAFVR ASGAGLLLPR MTVVGDLDLD ETLGPLLDPI GAGVDMPEAV DPVWRLLRIA GILRDELGED APGEAALLRQ ARGIAQGIDR LLVEGVQPER MLDEAVIGIA AELSEHWQES TRLFARVFFR WRAELEAIGK VDAPERRNRL LDHAARSWRE RPPAHPVIAA GVTSASPVVA RLLRTVADMP EGGVVLPDLD LALDPEVWDA LGSAGGPDGG LFERGDVVTH PQYHLKLLLN RMGIARDEVQ PWHRAGLAAA PPERSRAISN LFLPPEASAA WVSLEARERR LAGVRVMETA HPEEEAQAIA VLVREALKEP ERRVAVITPD RSLAARIVAH LGRWNIGADD TAGRPLPQTA AGRLLLQLAE VVAERAAPVP LLALLGHPLV QGGEGRPVWL ERVRQLDLVL RGPRPGPGLP AIRQAVDKTA KRFPALPDWW SGVEDLLFPL VPLEGAVPLD MALVALVEAG EALCGTALWA QADGRSLAAF VERWRDAAGD APAMVDVAEL PSLLRDAMEE ISVRPPWGGH PRLAIYGLLE ARMSRADLVI CGGLTEGTWP GSPAPDPLLA PAILRALGIP GAEFRIGLSA HDLAAALGAP EVVLSHARRD ASGPVIPSRF LLRIHAMLGD QLRIEERAVE LARALADADR IAPHPQPRPM PSAEQRRVPI AVTALDRLRG DPYQFYASAI LGLRSLDPID ADPTPAWKGT AVHDVLKAWH ESGGVPGQLV PLAERMFDEM SAHPFMRTMW KPRLVDALHW IEEETDRLAG EGREVLAVER KGEIVVDGIR IHGRADRIDR LPDGTLAVVD YKTGKPPSGK MVAEGFALQL GLIGLIARGG GMDGVAGEPT AFEYWSLGRN KERGFGYMKS PVKETARQTG IPREEFLDRT EDYLHEAIAR WLLGSEPFTA RLNPDLPGYS DYDQLMRLDE WQGRERKGGG GEP
|
| |