Gene Saro_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1998 
Symbol 
ID3917318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2129827 
End bp2132808 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content70% 
IMG OID640444749 
Producthelicase 
Protein accessionYP_497271 
Protein GI87200014 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.138335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACG CCCGCGCCCG CCCGCGCGTC TGGTCGATCG CGGCGCATCG CGGCTTTGCC 
GACGCGCTCG TCGCGGGTCT CGTCCCGCGT TATCGCGAGG ATCGCTTCGG CCTCGCCCGG
CTGACGCTTC TGCTGCCCAG CCAACGTGCC GTGCGCACGG TGACCGAGGC ATTCGTGCGT
GCCAGCGGGG CTGGCCTGCT GCTGCCGCGC ATGACCGTCG TCGGCGACCT CGATCTTGAC
GAGACGCTGG GGCCGCTGCT CGATCCCATA GGGGCGGGCG TGGACATGCC CGAGGCTGTC
GACCCGGTTT GGCGGCTCCT GCGCATAGCG GGCATCCTGC GCGACGAACT GGGCGAGGAC
GCGCCGGGCG AGGCGGCGCT GCTGCGGCAG GCGCGCGGGA TCGCGCAAGG GATCGACCGC
CTGCTCGTCG AGGGCGTGCA GCCCGAACGC ATGCTCGACG AGGCGGTGAT CGGGATTGCC
GCCGAGCTTT CGGAGCATTG GCAGGAAAGC ACCCGTCTGT TTGCGCGGGT GTTCTTCCGC
TGGCGCGCGG AGCTGGAAGC GATTGGCAAG GTCGATGCGC CAGAGCGGCG CAACCGCCTG
CTCGACCACG CCGCGCGAAG CTGGCGCGAG AGGCCGCCGG CGCATCCGGT CATCGCGGCC
GGCGTGACTT CCGCCTCGCC AGTGGTGGCA AGGCTGTTGC GCACGGTTGC CGACATGCCC
GAAGGAGGCG TCGTCCTCCC CGATCTCGAC CTCGCGCTCG ACCCGGAGGT CTGGGACGCG
CTGGGGAGCG CGGGTGGGCC TGACGGGGGC CTGTTCGAGC GTGGCGACGT GGTGACGCAC
CCGCAGTACC ACCTGAAGCT CCTGCTCAAC CGGATGGGCA TCGCACGCGA CGAAGTGCAG
CCCTGGCACC GCGCTGGTCT TGCCGCCGCC CCGCCCGAAC GAAGCCGCGC GATCTCCAAT
CTCTTCCTGC CGCCGGAAGC CAGCGCCGCC TGGGTCTCGC TTGAGGCGCG CGAGCGCCGT
CTGGCCGGCG TCAGGGTCAT GGAAACCGCG CATCCGGAAG AGGAAGCGCA GGCCATCGCC
GTCCTTGTCC GCGAGGCGCT GAAAGAACCG GAGCGCCGCG TTGCGGTGAT AACCCCCGAC
CGCAGCCTTG CCGCGCGCAT CGTGGCGCAT CTTGGCCGGT GGAACATCGG CGCCGACGAC
ACCGCCGGCC GTCCTCTGCC GCAGACGGCG GCGGGAAGGC TGCTGCTGCA ATTGGCCGAG
GTCGTGGCCG AACGCGCGGC ACCGGTGCCG CTGCTCGCGC TGCTCGGCCA CCCGCTCGTG
CAGGGCGGGG AAGGGCGTCC GGTGTGGCTG GAGCGCGTGC GCCAGCTCGA TCTGGTCCTG
CGCGGGCCGC GCCCTGGTCC GGGCCTGCCG GCAATCCGGC AGGCGGTGGA TAAAACGGCG
AAACGCTTCC CGGCGCTGCC CGACTGGTGG TCGGGCGTCG AGGACCTGCT TTTCCCGCTC
GTCCCGCTCG AAGGCGCGGT CCCTCTCGAC ATGGCCCTCG TCGCGCTGGT CGAGGCAGGC
GAGGCGCTTT GCGGAACTGC CCTGTGGGCG CAGGCAGACG GGCGCAGCCT TGCCGCCTTC
GTCGAACGCT GGCGCGATGC AGCAGGCGAT GCGCCGGCCA TGGTCGATGT CGCGGAACTG
CCCTCGTTGC TGCGCGATGC GATGGAGGAG ATCTCTGTCC GCCCACCTTG GGGCGGCCAC
CCGCGCCTCG CGATCTACGG CCTGCTCGAA GCGCGAATGA GCCGTGCGGA CCTCGTCATC
TGCGGTGGGC TGACCGAAGG CACCTGGCCG GGCAGCCCCG CGCCCGATCC GTTGCTGGCC
CCGGCGATCC TGCGCGCGCT CGGCATTCCC GGCGCGGAGT TCCGCATCGG CCTGTCGGCT
CATGACCTTG CCGCTGCACT GGGCGCACCC GAAGTGGTCC TGAGCCACGC CCGGCGCGAC
GCGAGCGGCC CGGTGATCCC CTCTCGCTTC CTGCTGCGGA TCCACGCGAT GCTGGGCGAC
CAGTTGCGCA TCGAGGAGCG CGCGGTGGAG CTTGCCAGGG CGCTTGCGGA CGCTGACCGC
ATCGCCCCGC ATCCGCAACC GCGCCCGATG CCTTCGGCAG AGCAGCGTCG CGTCCCCATA
GCCGTCACCG CGCTCGATCG GCTGCGCGGC GATCCCTACC AGTTCTATGC CTCGGCGATC
CTTGGCCTGA GGAGCCTCGA TCCGATCGAT GCCGATCCGA CGCCCGCCTG GAAGGGCACG
GCGGTCCATG ACGTGCTCAA GGCATGGCAC GAGTCCGGCG GCGTCCCGGG CCAGCTCGTT
CCACTGGCCG AGCGCATGTT CGACGAGATG AGCGCGCACC CGTTCATGCG CACCATGTGG
AAGCCGCGCC TTGTGGACGC GCTGCACTGG ATCGAGGAGG AGACGGATCG GCTTGCCGGG
GAAGGCCGCG AAGTCCTCGC CGTGGAACGC AAGGGCGAGA TCGTGGTCGA CGGCATCCGC
ATCCACGGTC GCGCGGACCG TATCGACCGG CTGCCCGACG GTACGCTCGC GGTGGTCGAC
TACAAGACGG GAAAACCGCC TTCGGGCAAG ATGGTGGCCG AGGGCTTCGC CTTGCAGCTC
GGCCTGATCG GCCTGATCGC ACGCGGCGGC GGCATGGACG GTGTGGCGGG AGAGCCCACG
GCGTTCGAAT ACTGGTCGCT TGGCCGCAAC AAGGAACGCG GCTTCGGCTA CATGAAGTCT
CCGGTGAAGG AGACGGCGCG CCAGACCGGC ATCCCGAGGG AAGAGTTTCT CGACCGCACG
GAGGACTACC TGCACGAAGC CATAGCGCGC TGGTTGCTTG GATCGGAACC CTTCACCGCA
AGGCTCAATC CCGACCTTCC GGGCTATTCT GACTACGACC AGCTCATGCG CCTCGATGAA
TGGCAGGGCC GTGAGCGCAA GGGAGGCGGC GGCGAGCCAT GA
 
Protein sequence
MDDARARPRV WSIAAHRGFA DALVAGLVPR YREDRFGLAR LTLLLPSQRA VRTVTEAFVR 
ASGAGLLLPR MTVVGDLDLD ETLGPLLDPI GAGVDMPEAV DPVWRLLRIA GILRDELGED
APGEAALLRQ ARGIAQGIDR LLVEGVQPER MLDEAVIGIA AELSEHWQES TRLFARVFFR
WRAELEAIGK VDAPERRNRL LDHAARSWRE RPPAHPVIAA GVTSASPVVA RLLRTVADMP
EGGVVLPDLD LALDPEVWDA LGSAGGPDGG LFERGDVVTH PQYHLKLLLN RMGIARDEVQ
PWHRAGLAAA PPERSRAISN LFLPPEASAA WVSLEARERR LAGVRVMETA HPEEEAQAIA
VLVREALKEP ERRVAVITPD RSLAARIVAH LGRWNIGADD TAGRPLPQTA AGRLLLQLAE
VVAERAAPVP LLALLGHPLV QGGEGRPVWL ERVRQLDLVL RGPRPGPGLP AIRQAVDKTA
KRFPALPDWW SGVEDLLFPL VPLEGAVPLD MALVALVEAG EALCGTALWA QADGRSLAAF
VERWRDAAGD APAMVDVAEL PSLLRDAMEE ISVRPPWGGH PRLAIYGLLE ARMSRADLVI
CGGLTEGTWP GSPAPDPLLA PAILRALGIP GAEFRIGLSA HDLAAALGAP EVVLSHARRD
ASGPVIPSRF LLRIHAMLGD QLRIEERAVE LARALADADR IAPHPQPRPM PSAEQRRVPI
AVTALDRLRG DPYQFYASAI LGLRSLDPID ADPTPAWKGT AVHDVLKAWH ESGGVPGQLV
PLAERMFDEM SAHPFMRTMW KPRLVDALHW IEEETDRLAG EGREVLAVER KGEIVVDGIR
IHGRADRIDR LPDGTLAVVD YKTGKPPSGK MVAEGFALQL GLIGLIARGG GMDGVAGEPT
AFEYWSLGRN KERGFGYMKS PVKETARQTG IPREEFLDRT EDYLHEAIAR WLLGSEPFTA
RLNPDLPGYS DYDQLMRLDE WQGRERKGGG GEP