Gene Sare_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1531 
Symbol 
ID5708322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1764334 
End bp1766214 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content77% 
IMG OID641271042 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_001536418 
Protein GI159037165 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000363104 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGGCCG AGTCGGAGTG CGTGCCCGAG CGGCCGGCGC GATCCGAGCC GGGCCGCCGG 
CTGCTCTGGG CGCCACTGGG CGGACTGACC GTGCTGGTGC TCGCCGGCGC GGCGGCACGG
CTCGCCGGCC GGCCCGGCCT CGGTGACGCG CTCTGGGCCG CCGCCACCGT GGCCGCGCTG
GTGCCGGCCG CCGCCTCGAT GCTGCGCGAG CTCTGGCACC GCCGGTACGG CGTGGACGTC
ATCGCCGTCC TCGCGCTGGC CGGTGCGCTC GTCGTGCGGG AGTACCTGGC CGGGGCGGTG
ATCGCGGTGA TGGTCGCCAC CGGCCGGACG CTCGAGGCGT ACGCCCAGGG TCGGGCGACC
CGCGACCTGC GTGCGCTGCT CGCCCACGCC CCGCGGACCG CACGCCGACG CGCGGCGGAC
GGCACGATCG AGGTGGTCCC GGCCGACCAG GTCGCTGCGG GGGACCAGCT GTTGGTCGGC
CCCGGCGACG TGGTGCCGGT GGACGGGCAG CTCGACGCGG CGGCCACCCT CGACGAGTCG
GTGGTGACCG GCGAGTCGCA ACTGGCCCAG CGCCCCGCTG GGGACCGGGT GGGCAGCGGT
GTGGTGAACG CCGGTGCCGC GTTCGGCATG CGGGCCACCG CGGAAGCGGC GAACAGCACG
TACGCCGGGA TCGTGCGGTT GGCGCGTGAG GCCACCGCGC ACAAGGCCCC GACGGTGCGG
CTGGCCGACC GGTACGCCGT CGCCTTCGTC CCGTTCACCC TGGCGCTCGC CGGGCTGGGT
TGGCTGGTCT CCGGCGACGT GGTCCGGGCG GTGGCGGTCC TGGTGGTGGC CACGCCCTGC
CCGCTTCTGT TGGCGACCCC CATCGCCATC GTCTCCGGCC TGTCCCGCAC GGCGCGGCAC
GGTGTCCTGG TCCGTAACGG TGGCTCCCTC GAACTGCTCG GCCGGGCCCG CACCCTGCTG
GTGGACAAGA CCGGCACGTT GACCGCCGGC CGGCCTCGGG TGGCCGAGAC GGTGCCGGCG
CCGGGAACCA CGGCGGACGA GGTGCTGCGC CTCGCCGCCT CCGTGGAGCA GCTCTCCCCG
CATGTGCTGG GCCGTGCCCT GGTGGAGGGG GCCCGGGAGC GAGGGATCGC CCTCGCCGAA
CCGGGCGGGG TCACCGAGGA ACCGGGCCGG GGCGTACGGG GGCGGGTGGA CGGCGGTGAG
GTGTGGGTGG GGCAGCTCGA CGGCCCGCCA CCCGAGTGGG CGGAGCCGGC CCGGGACCGT
GCCGAGCGGG CCGGCCACTC CCTGGTGTGG GTCGGTGGTA CCGCCGGGCC GCTCGGCGTG
CTCCTGTTGG CGGACCCGGT CCGGCCCGAC GCGTCGCGGA CCGTCGGCCG GCTGCGGGCG
GCGGGGCTGC GCCGGATCGT CATGGTCACC GGTGACCGCC CGGCCACCGC TGGTCGGGTG
GCCCGTCAGG TCGGGGTCGA CGACGTGGTC GCCCACTGCG CGCCCGCCGA GAAGGCGGAG
CGGGTCCGCG CCGAGGTGGG CCGGGCGGTC ACCGTGATGG TCGGGGACGG GGTGAACGAC
GCCCCCGCCC TGGCCACCGC CCACGTCGGC GTGGCGATGG GCGCCACCGG GGCGACCGCG
TCGGCGGATG TCGCCGACGC GGTCCTCACC GTTGACCGAC TGGAACGCCT GGCCGACGCC
GTGGAGATCG CCCGGTACGC GCGCCGCATC GCGGTGCAGA GCGCCACGGT GGGTATGGGG
CTCGCCGTGC TGGCCATGTT CGTCGCCGCG GTCGGGCGGC TGCCACCGGT GGCCGGTGCC
TTCCTCCAGG AGGGCATCGA CGTTCTGGTG ATCCTCAACG CGCTGCGTGC CCTGTTCGGC
CCGGCCAGCA CGCGACGGTG A
 
Protein sequence
MEAESECVPE RPARSEPGRR LLWAPLGGLT VLVLAGAAAR LAGRPGLGDA LWAAATVAAL 
VPAAASMLRE LWHRRYGVDV IAVLALAGAL VVREYLAGAV IAVMVATGRT LEAYAQGRAT
RDLRALLAHA PRTARRRAAD GTIEVVPADQ VAAGDQLLVG PGDVVPVDGQ LDAAATLDES
VVTGESQLAQ RPAGDRVGSG VVNAGAAFGM RATAEAANST YAGIVRLARE ATAHKAPTVR
LADRYAVAFV PFTLALAGLG WLVSGDVVRA VAVLVVATPC PLLLATPIAI VSGLSRTARH
GVLVRNGGSL ELLGRARTLL VDKTGTLTAG RPRVAETVPA PGTTADEVLR LAASVEQLSP
HVLGRALVEG ARERGIALAE PGGVTEEPGR GVRGRVDGGE VWVGQLDGPP PEWAEPARDR
AERAGHSLVW VGGTAGPLGV LLLADPVRPD ASRTVGRLRA AGLRRIVMVT GDRPATAGRV
ARQVGVDDVV AHCAPAEKAE RVRAEVGRAV TVMVGDGVND APALATAHVG VAMGATGATA
SADVADAVLT VDRLERLADA VEIARYARRI AVQSATVGMG LAVLAMFVAA VGRLPPVAGA
FLQEGIDVLV ILNALRALFG PASTRR