Gene Saro_2790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2790 
Symbol 
ID3916950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3010293 
End bp3011990 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content67% 
IMG OID640445569 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_498060 
Protein GI87200803 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0567243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCGCG TGAAGACCCT TGCCCGCTTT TCGTCATTCC TGCTTGCAAC CGTTCTTCCG 
CTGGCGCCCG CGATCGCTCA GGAAGCGCCG CCCCCGCCCC CATCGCTGCC GATTGTCTCC
CAGCAGGTGC CGCAGAACGC GCCGCGCCTT GTCGTGGCGA TCTCGATCGA CCAGTTTTCT
TCCGATATCT TCGCACAGTA TCGCCAGCGC TTTACCGGCG GTCTTGCGCG CCTGACCACG
GGCGCGGTCT TCCCCGCCGG CTACCAGAGC CACGCGGCGA CCGAGACCTG CCCGGGCCAC
TCGACGATCC TTACCGGCAA TCGCCCCGCC CATACGGGGA TCATCGCCAA TTCCTGGATC
GACCAGTCGG TCGCCGTCGG GCCCAAGCAG GTCTATTGCG CCGAGGATAC GGGCAAGCGT
GCCGCCGGAT CGAAGGACTA TGTCGCATCG CCCATCCACC TGCTGGTGCC GACGCTGGGC
GAATGGATGA AGCGGGCCAA TCCGGCGGCG CGCAATATCG CGGTATCGGG CAAGGACCGC
GGCGCACTGA TGATGGGCGG GCACGATACC GACCAGGTCT ATTGGTGGAA GGGCAAGGGC
TTCGTCACCC TTGCCGGGCG CGAGCCGGGG CCGACCGCCA TCGCGCAGAA CGTCGAGATC
GCGCGCACGC TCGCCAAGGG CGCGCCGGCC TTCCCGCTGC CCGCTTACTG CGGGCGCAAC
GACCGGGCCG TCACCGCCGG CGAGGTGACG GTGGGGACCA ACCACTTTGC CCTGAAGGCC
GGCGATGCCG ACGGCTTCCG CATATCGCCC CGGCTCGACC GGGCGACGCT CGATCTTGCG
GTCAAGCTGG TGGACGAACA GAAGCTGGGG CGCGGCGCGG TGCCCGACCT TCTGGCGGTG
AGCCTTTCGG CAACCGATTA CGTTGGCCAC GCCACGGGTA CCGAGGGCGC GGAAATGTGC
ATCCAGCTTG CGCAGCTCGA CCTGGCGCTC GGCGATTTTC TTGCCAGCCT CGACAAGCGG
GGCATCGACT ATGCGGTGGT GCTGACGGCC GACCACGGCG GCTTTGACAT TCCCGAACGC
CTCCACGAGC AGGCGCTGGA GAAGAGCGCG CGGGTCGGGC CGGAAGTCTC GGCCGAGGCG
CTGTCGGAGG CGCTGGGCAA GCGTTACGGG CTTGCGCCCA AGGGGCTGAT CCTGGCGGAC
GGGCCGGCGG GCGACTACTG GCTGCGCAAG GACCTTGACG AGGGTCTGCG CGGCAGGATC
GTCGACGACG CGAAGGCGAT CCTGATGACG AGCCCCTATG TCGAGAAGGT GCTTTCCGCC
GCGGAAATCG CGGCAACGCC GATGCCGTCG GGCTCGCCGG AAACCTGGAC CCTGGCCGAA
CGCGCGCGCG CTTCCTACAA CCCGCTTCAC TCGGGCGATT TCGTGGTGAT GCTCAAGCGC
AGCGTCGTGG CGATCCCGAC GCCGCGCGCG GGTTATGTGG CAACCCACGG CAGCCCCTGG
GACTATGACC GCCGCGTGCC GATCATGTTT TGGCGCAAGG GCATGAACGG CTTCGAACAG
CCGAGCGCGG TGGAAACGGT GGACATTGCC CCCAGCCTTG CTGCACTGAT CGGTCTCAAG
ATCCCGCAAG GTGCATTCGA CGGGCGCTGT CTCGACCTCG ATGGTGGCCC GGCCAATACG
TGTGGAGCAG GCAAGTGA
 
Protein sequence
MFRVKTLARF SSFLLATVLP LAPAIAQEAP PPPPSLPIVS QQVPQNAPRL VVAISIDQFS 
SDIFAQYRQR FTGGLARLTT GAVFPAGYQS HAATETCPGH STILTGNRPA HTGIIANSWI
DQSVAVGPKQ VYCAEDTGKR AAGSKDYVAS PIHLLVPTLG EWMKRANPAA RNIAVSGKDR
GALMMGGHDT DQVYWWKGKG FVTLAGREPG PTAIAQNVEI ARTLAKGAPA FPLPAYCGRN
DRAVTAGEVT VGTNHFALKA GDADGFRISP RLDRATLDLA VKLVDEQKLG RGAVPDLLAV
SLSATDYVGH ATGTEGAEMC IQLAQLDLAL GDFLASLDKR GIDYAVVLTA DHGGFDIPER
LHEQALEKSA RVGPEVSAEA LSEALGKRYG LAPKGLILAD GPAGDYWLRK DLDEGLRGRI
VDDAKAILMT SPYVEKVLSA AEIAATPMPS GSPETWTLAE RARASYNPLH SGDFVVMLKR
SVVAIPTPRA GYVATHGSPW DYDRRVPIMF WRKGMNGFEQ PSAVETVDIA PSLAALIGLK
IPQGAFDGRC LDLDGGPANT CGAGK