Gene Saro_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3983 
Symbol 
ID5077513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp149792 
End bp152833 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content67% 
IMG OID640481089 
Producthypothetical protein 
Protein accessionYP_001165751 
Protein GI146275590 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR02686] conjugative relaxase domain, TrwC/TraI family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTCGA TCGCCTCCGT CCGGTCGTCG AGCGGCGCTG CCGACTACTT CGCCAACGAC 
AACTACTACT CGGCCGATGA GCACGCGGAA GCGGGGGTCT GGGGCGGCGA AGGTGCGCGC
GCGCTGGGGC TTGAGGGGCA GGTCGAGCGC GATGCGTTCG AGGGCGTGCT CAATGGGCGG
CTGCCCGACG GCGAGATGGT CGGGCAGGTC GAAGGTCGGC GCCTGGGTCT CGATCTCACC
TTCTCGATGC CCAAGTCTGC CTCGATCCTG GCGCTGGTCA GCGGCGACCG GCGGATCATC
GATGCGCACC TGGCGGCGGT CAAATCGACC ATGTCGCAGC TTGTCGAAAA GCAGTTCGCC
GAGAGCCGCA ACTATGAGCG CAGCCGCAGC GGCGAACCCC AGAAGACCGG TAACCTCGTC
TATGCCCTGT TCGCCCACGA TACGAGCCGC GCGCTCGATC CGCAGGGGCA TATCCACGCC
GTCGTCGCCA ATCTGACTCG CGATCCCAAG GGCACCTGGA AGGCGCTGTG GAACGGCGAG
ATCTGGAAGA ACAACACCAC GATCGGCCAG TTCTACCACG CCGCCTTCCG CGCCCAGTTG
CAGAAGCTCG GCTACGAAAC CGAGGCTGCC GGCAAACACG GGTCGTTCGA GATCAAGGGC
GTGCCCGCCG AGGTGATCAA GGCCTTCTCG ACCCGCACCA CCGAGATCGA GGCGAAGATC
GCCGAGGTCG GGGCAACCCG CCTCGAAACC AAGAAGCAGA TCACGCTTTA TACCCGCGAT
CCCAAGCTGG CGGTCGAGGA TCGCGGCGCA CTGGTCGAGG GCTGGCAGCA GCGCGCGGCC
GAGCTCGGGT TCGACGGCAA GGCTCTTGTT GCCGCAGCCA AGGCGCGCGC CGAAGTCGAG
GCGCGGCCGA GCTTCCGCGA GACGGCCACG GCCGCGATCG GCGAAGTTTC GACCCGGATC
AACGCCGCGC TGCGCACCCC AAGTCCGCTT GCCGTGAGCG GGGCCGCAGC ATTGTTCCTG
TCGGCCGCCA CCATCAAGGC TCAGCACGCC ACCGCTTCGG CGATCCGTCA TCTCTCCGAG
CGCGAGGCAG CGTTCTCGCC GCAGGCAATC CTGGCGAGCG CGCTGGGGTT CCATATCAAG
GGCCTCGAGG GCGGCGCAGT GGTCCAGCGA ATTGGCGAGC TCATTCGCGA CGGGCACCTG
ATCCCCGGCA AGTCGGACCG GCTCGACGGG CACTCCGATC TCGTGACCAC GCCGGCCGCA
CTGGCCATGG AGCAACGCAT CCTTGACACC ATCGAGCGGG GTCATGGCGA GGGCCGCGCC
TTCATGCCGC CCGAGGCGGC GATGACCCGG CTGCAGGAGG CCGCGCGTGA ACTTGGGCGC
GAGCGTGCCG GGGTGGACAC CTGGCAACTC AACGAAGGCC AACTCGCGGC GGGCGTGGCG
ATCCTCTCAG GCGGCGATCG CTTCCTCAAC GTCCAGGGCG TCGCCGGTGC GGGCAAATCC
ACCCTGCTAG GCGCACTCGA CAAGGTGCTG GACGCGGAAG GCGTGAAGCT TGTCGGGCTC
GCTTTCCAGA ACAAGATGGT CGCTGATCTG CGCGGTGGCG GCGGTCAGGG CATGTCGGCC
GACCAGATGC GCGAGGCGGG GATCGAAGCC TATACCATCG CCCGGTTCCT CAATACCTAC
AGCTCGGCGG CCGCTGCAGG CAGCGGCGAG CGCTACGAGG CAGCGAAGGC TGCGCTCGCC
AATACCGTCA TCATCACCGA CGAAAGCTCG ATGGTTTCCT CGCGCGACAT GCTGCGCCTG
GTGACGCTCG CCGAGCAGCT CGACCTCGCC AAGGCGCCGT TCATGGGCGA CCGCCAGCAA
TTGTCGGCGA TCGAGCAGGG CAAGATGTTC GCGGTCTCGC AGGCCTCTGG TCAGGCGACC
GTGCGGATGG ATGAGAACAT ACGCCAGAAG AACTCGCCGC TTCTCCTCGC AGTTGCGGGG
CTCTCGAACG AGGGCCATGC CGGCCTCGCA CTTGACCTGC TTGCCGCGCA CGGCCGCGTA
ATCGAAGCAG GGCCCGACCA TGTCGCCCGC GCCGCCGAGC TGTGGCTCTC GCTCGAACCC
GACAAGCGCG AAGCGACCGC GATCTTCACC GCCGGGCGTG ACGATCGCGC CGAGATCAAC
GCTCGGGTCC AGGCGGGGTT GCTCAAGGAA GGGACATTGA GCGGCGCGGG CGTGCCGCTG
GCGACCCTCC AGAGCGTCAA CGCCACCCGC GAGGAGCTGC GGTTCGCCTC GACCTACCGG
GTGGGTCAGG TGCTTGAGGC GCGGATGGAG GTGCGCGAGA TCGGCCTCAA GGCCGGCGAA
TACCGGGTGA GCGAGGTGCG CAAGGACGGC AAGGTCGTGC TCGAGCGCGA TGGCAAGCGC
AAGGTCATCG ATCCTGACCG GATCAATCCC GATCACCGCT TCGACCGGCT CGGGCTTCAT
GAAGAGAAGC AGATCCGCCT CCACGACGGC GAAACCGTGT TTTGGCGTGA CAAGGACGGG
TCCCGAGACA TTGCCAAGTC GACCTACGCC ACCGTGCTCG AGGCGCGAGC AGAAGGCATC
CGGGTCGAAC TTGCGGACAA GCGGCAGCTT GTCCTGGTCC CCGGCGATCC GATGCTGCGC
CGTCTCGACC TTGGCTATGC GCTCAACGCT CACATGGCGC AGGGCATGAC CAAGCCACAG
GCGATCGAGG TCATCTCTTC GACCCAGCGC AACCTTGCCA CCCAGCGCAC GCAGAACGTC
CTCAATACCC GCGCCACTGA CGACATGACC GTCGTCACCA ACAACCTCGA AGGCCTGAAA
CAGCAACTCG ACCGCACGCC CGGCAACAAG ACCTCAGCGC TCGAAGTCAC CGGTAAGGTC
GAGGTTGAGC CGCGCCATGT GAACCCGATC GATACCCGGC AGGTGCCCGA ACTCCGCATG
AGCCCCGAGC TCAAGGCCAA GCTCGACGCA GTGCTCGGCA AGGTGGCGGA GGCGCCGGTG
AAGCAGATGC GCCCGGAAAA GACGCTGGGG CTCGACCTGT GA
 
Protein sequence
MHSIASVRSS SGAADYFAND NYYSADEHAE AGVWGGEGAR ALGLEGQVER DAFEGVLNGR 
LPDGEMVGQV EGRRLGLDLT FSMPKSASIL ALVSGDRRII DAHLAAVKST MSQLVEKQFA
ESRNYERSRS GEPQKTGNLV YALFAHDTSR ALDPQGHIHA VVANLTRDPK GTWKALWNGE
IWKNNTTIGQ FYHAAFRAQL QKLGYETEAA GKHGSFEIKG VPAEVIKAFS TRTTEIEAKI
AEVGATRLET KKQITLYTRD PKLAVEDRGA LVEGWQQRAA ELGFDGKALV AAAKARAEVE
ARPSFRETAT AAIGEVSTRI NAALRTPSPL AVSGAAALFL SAATIKAQHA TASAIRHLSE
REAAFSPQAI LASALGFHIK GLEGGAVVQR IGELIRDGHL IPGKSDRLDG HSDLVTTPAA
LAMEQRILDT IERGHGEGRA FMPPEAAMTR LQEAARELGR ERAGVDTWQL NEGQLAAGVA
ILSGGDRFLN VQGVAGAGKS TLLGALDKVL DAEGVKLVGL AFQNKMVADL RGGGGQGMSA
DQMREAGIEA YTIARFLNTY SSAAAAGSGE RYEAAKAALA NTVIITDESS MVSSRDMLRL
VTLAEQLDLA KAPFMGDRQQ LSAIEQGKMF AVSQASGQAT VRMDENIRQK NSPLLLAVAG
LSNEGHAGLA LDLLAAHGRV IEAGPDHVAR AAELWLSLEP DKREATAIFT AGRDDRAEIN
ARVQAGLLKE GTLSGAGVPL ATLQSVNATR EELRFASTYR VGQVLEARME VREIGLKAGE
YRVSEVRKDG KVVLERDGKR KVIDPDRINP DHRFDRLGLH EEKQIRLHDG ETVFWRDKDG
SRDIAKSTYA TVLEARAEGI RVELADKRQL VLVPGDPMLR RLDLGYALNA HMAQGMTKPQ
AIEVISSTQR NLATQRTQNV LNTRATDDMT VVTNNLEGLK QQLDRTPGNK TSALEVTGKV
EVEPRHVNPI DTRQVPELRM SPELKAKLDA VLGKVAEAPV KQMRPEKTLG LDL