Gene Sare_3463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3463 
Symbol 
ID5708065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3994409 
End bp3995647 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content74% 
IMG OID641272890 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001538256 
Protein GI159039003 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0116181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCC CGCGTGAGCC CACCGGGCCG CCCGACGCCG CGTCGGCGCG GGCAGGCACC 
GGTTCGAGCC CCGCCGCTGA CCTCGGGACG AGTGATCCGT TCGGAACTCT CGCCCCCCGG
TACGGCGAGG CAAGCCTCGC CGACATCCTG CCCGCCGCGC TCGCGGCGTT GGGAGTGCCC
GGTGCGACCG ACGTGCTCGG CCTGACCGCC GCGCTCGATG GTGTGCGCCG GATCGCCGTG
CTGCTCGTAG ACGGGCTTGG CTGGCATCAG ATCCCCACCG CCGCGCCGTA CGCGCCGACT
CTGGTCGGGC TGGCCACGAC CGCCGGTCGC CCGCTCACCT CCGGCTTTCC GTCTACCACT
CCGACCAGCC TGGTGACCCT GGGCACTGGC ACCGTCCCCG GCGCGCACGG TGTGCTCGGG
TTCACCCTGC GGGTCCCCAA CACCGGCCGG GTGCTCAACC ACATCGACTG GGCCGGCGAT
CCGGAGCCGC TGCACTGGCA GCCCGTCCCC ACCCAGCTGG AACGGGCCCG TGCCGCCGGT
GTGCAGGTGA CGGTGGTGAG CCGGCCCGAG TTCGGCGGCA GCGGGCTCAC CCTGGCTGCC
AACCGGGGCG GCGACTACCG CGGCGCGGCG GGCGTTGACG AGCTGGGTCG GGAGATGTTG
GCCGCGCTGA CCGCCGGAAC CGGGCCCACC CTGGTCTCCG GCTACCACCC CGACCTCGAC
CGGCATGGCC ACCTCAGCGG GGTCGATTCG GCGCCCTGGC GGGCCGCCGC GGCCGACGTC
GACCGGCTGC TGACCCGTCT CGTGGAGGGG CTGCCACCGG ACGCCGCGTT GCTGGTCACC
GCCGACCACG GTCAGCTCGA CATACCAGCC GAGCACCGGT TCGACCTGGA CACCGATCCG
CGCCTGCGTG CTGGCGTGAC CGTGGTGGCC GGCGAGCCCC GGGTCCGCTA CCTGCACGTA
CGGCCCGGTG CCCGAGACGA CGTGGTGGCC ACCTGGTCGG CAGTACTCGG CGCCGCGGCC
CGGGTCACGA CCCGGGCGGA GCTGGTGGCA GCCGGCTGGT TCGGTCCGGT GCCCGAGGAA
CACCTGGGCC GGATCGGGGA CGTGGTGGTG GTCTGCAACG GCAGCTACGC CGTGCTGGCC
ACTCGTTCGG AGCCGCCGAT AGCGAGTCGA CTGGTGGCGT ACCACGGCTC GGACACCGCG
GCGGAGATGA CGATTCCATT GCTGGTGGTC CGAGGCTGA
 
Protein sequence
MTGPREPTGP PDAASARAGT GSSPAADLGT SDPFGTLAPR YGEASLADIL PAALAALGVP 
GATDVLGLTA ALDGVRRIAV LLVDGLGWHQ IPTAAPYAPT LVGLATTAGR PLTSGFPSTT
PTSLVTLGTG TVPGAHGVLG FTLRVPNTGR VLNHIDWAGD PEPLHWQPVP TQLERARAAG
VQVTVVSRPE FGGSGLTLAA NRGGDYRGAA GVDELGREML AALTAGTGPT LVSGYHPDLD
RHGHLSGVDS APWRAAAADV DRLLTRLVEG LPPDAALLVT ADHGQLDIPA EHRFDLDTDP
RLRAGVTVVA GEPRVRYLHV RPGARDDVVA TWSAVLGAAA RVTTRAELVA AGWFGPVPEE
HLGRIGDVVV VCNGSYAVLA TRSEPPIASR LVAYHGSDTA AEMTIPLLVV RG