Gene Sare_3867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3867 
Symbol 
ID5705898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4402306 
End bp4403865 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content71% 
IMG OID641273288 
Productalkaline phosphatase 
Protein accessionYP_001538650 
Protein GI159039397 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3540] Phosphodiesterase/alkaline phosphatase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.261965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00291671 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGAGT TCGATCGACG TATGCTGCTG CGTGCCGGCC TGGCGGTGGG TGCGGGAGCC 
GCCGGTGGTG TGCTCCTTGG TGGTGCCGGT GTCAGCGCTG GGCCAGCCGC TCCGGGGTGG
CGCCCGGCCG GCCGTCCTGT TTTGACGCAC GGGGTGCAGA GCGGCGATGT GTCCGCCGAG
TCGGCGGTGG TGTGGACCCG GGCCGACCGG CCTGGCCGGA TGCTCGTGGA GGTGAGCCGC
CGGCCCGACC TGCGGGACGC CCGGCGCCTG CGGGGGCCGG TGCTGGACCC GGCCGGGGAC
CTCACTGGCA AGATGCGCCT GCGGGGCCTG CCGGCCGGCG AGCGCTGGTA CTACCGGGTT
CGCGTGGAGA GCCTGGACCG GCCGGGGCTG TGTAGTGAGC CGCTGACCGG GTCGCTGCGT
ACCGCCCCCA GGGGGCGCAT GCGGCGCGAC ATCCGGTTCG TCTGGACCGG GGACATCGCT
GGACAGGGCT GGGGTATTGC CCCCGATTTC GGCGGTATGT CCATCTTCGC CGCCATGCGC
GCCGCCCGCC CCGACTTCTT CATCTGTAGC GGCGACACGG TGTATGCCGA CAACCCGTTG
ACCGAGACGG TGCCGCTGCC CGATGGGCGG ATCTGGCGGA ACCTCGTCAC CCCGGAGAAG
AGCAAGGTGG CCGAGACCCT GGCGGAGTTC CGGGGGCAGT ACGCGTACAA CCTGCTCGAC
GAGCACCTGC GTGCGTTCGT TGCCGAGGTG CCGCAGGTCA ACCAGTGGGA CGACCACGAG
GTGACGAACA ACTGGTACCC GGGTGAGGTG CTGGCCGACG ACCGGTACAC CGAGAAGCGG
GTCGACGTGC TCGCCGCCCG TGCTCGGCGG GCGTTCGACG AGTGGTTGCC CACCCCGGTC
CGTGGACCCC GCTACCGACG GCTGTCGTAC GGGCCGTTGT TGGACGTCTT CGTGCTGGAC
ATGCGCACAC ACAAGGACCC GAACGACGGG AACACCTACC CTGACCCGAA CCGGGGGTTG
CTCGGCCGGG AGCAGCGGGA GTGGCTGATC CGTGGGCTGA CCCGCTCCCG GGCGACGTGG
AAGGTGATCG CCGCCGACCT GCCACTCGGT TTGGTGGTGC CGGACGGTGC GGCCCAGGAG
GGGGTGGCGC AGGGCGACCC GGGGGCGCCG GCGGGCCGGG AGCTGGAGTT CGCCGAGGTG
CTCACGGCGG CCCATCGGGC CGGGGTGAGC GGCATCGTCT TCCTCACCGC CGACGTTCAC
TACACCGCCG CCCACCACTA CGACCCGGCC CGGGCGGCAA TCGACGACTT CACGCCGTTC
TGGGAGTTCG TCTCCGGTCC GGCGCACGCT GGTGCGTTCG GCCCGAGCCA GCTGGATGGC
ACGTTCGGCC CGAAGGCGGT CTTCGTCAAC GCACCACCTG CCGCGAACAC CAGCCCCGCA
GCCGGTTTCC AGCACTTCGG CGAGGTGCAC ATCGATGCCG GCAGCGGTGC CTGCACCGTC
CATCTGCGCG ACCGCGCCGG CAGATCCCTC TGGACCACCA CCCTTCCCGC TCCGCGCTGA
 
Protein sequence
MTEFDRRMLL RAGLAVGAGA AGGVLLGGAG VSAGPAAPGW RPAGRPVLTH GVQSGDVSAE 
SAVVWTRADR PGRMLVEVSR RPDLRDARRL RGPVLDPAGD LTGKMRLRGL PAGERWYYRV
RVESLDRPGL CSEPLTGSLR TAPRGRMRRD IRFVWTGDIA GQGWGIAPDF GGMSIFAAMR
AARPDFFICS GDTVYADNPL TETVPLPDGR IWRNLVTPEK SKVAETLAEF RGQYAYNLLD
EHLRAFVAEV PQVNQWDDHE VTNNWYPGEV LADDRYTEKR VDVLAARARR AFDEWLPTPV
RGPRYRRLSY GPLLDVFVLD MRTHKDPNDG NTYPDPNRGL LGREQREWLI RGLTRSRATW
KVIAADLPLG LVVPDGAAQE GVAQGDPGAP AGRELEFAEV LTAAHRAGVS GIVFLTADVH
YTAAHHYDPA RAAIDDFTPF WEFVSGPAHA GAFGPSQLDG TFGPKAVFVN APPAANTSPA
AGFQHFGEVH IDAGSGACTV HLRDRAGRSL WTTTLPAPR