Gene Sare_4172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4172 
Symbol 
ID5703960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4737893 
End bp4739419 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content73% 
IMG OID641273599 
Productadenylylsulfate kinase 
Protein accessionYP_001538952 
Protein GI159039699 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0025253 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGACG GTTGGGTGCC GCCCGACGGG GTGCTGCGGG ACGCGCCGGC GTACTCGCCG 
CGCCCGGGTG AGCTGGCCGA CCTGGAGCTG CTGCTGACCG GGGCGTACGC CCCGTTGACC
GGCTTCATGA CCCGCGCCGA CCTGGTCTCG GTGAGCCGGC GGGGCCGTCT CGCCGACGGC
GTTCCCTGGC CGGTGGCGGT GACCCTCCAG GTGCCGGCAG CGCTCGCGGA GGGCCTCAAC
GTACGGGATC CGGCCCGACG CACGTTGGTG TTGACCGACG GCGAGGGGGC CCCGATGGCC
GCGCTGGAGG TAGCCGACAC CTGGTCGGTT CGCCCCGGGG TGGCGGGGCT GGGCGGTTCG
GTGCGCCGAC TCGGCGACGG TGGACACGGG CCGTTCCAAC GGCTGCGCCG CACCCCCGAG
GAGGTTCGCT CACTGCTGCC AGCCGGCCGG GTGCTCGGGG TCGTCGCCGA CCGCCCGCTG
CACCGGCCAC AGCTCGCCCA GATCGCGCAC GCGACCCGCA CCCTCGGCGC ACACCTCCTG
GTGCTGATCC CGGTGGGTGA GGACGGCGGC GAGCTGCCGC CGGAGGCCCT GGTGCGCAGC
GTCTTTGCCG CCCGGGACCG AATGCCACCG GCGACGCTGG TCGCCATTCC GCTGCCCCGA
CGCCGGGAGG AGATCAGCGA CGCGCTGCTG CGGGCGCGGG TGTCGGCGGC GTACGGGGTG
ACCCACCTGC TCTCCACCGG CGAGATGCTC TCCGGGGCCG GGCTGCGGGT GCTGGTCCCC
CGCGAGTTGG CGTACGACAA CCGCGACGGG CAGTGGCGAT GGCGGGAGGA CATTCCGCCG
CGCAACCGGC GGCAGGCACT GAGCCAGGCC GAGATCGACG ACCTGCTGGA CCGGGGCTTC
CCCCTGCCGG AGTGGCACAC TCCCCCGGCC GTGGCGAAGG AGTTGGCCCA CGCCCGTCCA
CCCCGCCGGC ACCGAGGCTT GGTGATCTTC CTGACCGGCC TTTCCGGCTC CGGCAAGTCC
ACCATCGCCC GAGGGCTGGC TGACGCCCTG CGGGAGCAGG GTGAGCGAAC GATCACCCTG
CTCGATGGCG ACGTCATACG CCGAGAACTA TCGGCGGGGC TCGGTTTCAG CAAGGCCGAT
CGGGACGCGA ACGTCCGCCG GATCAGCTGG GTGGCCGCCG AGATCGGCCG GCATCGTGGG
GTGGCGATCT GCTGCCCGAT CGCCCCGTAT GCCCGGGCGC GGGCCACCGC CCGGGAGATG
GCGCTGGCCA CGGGGGCGGG CTTCCTGCTG ATCCATGTCG CCACCCCGCT GGAGGTCTGC
GAGCGGCGGG ACCGTAAGGG TCTGTACGCG CGTGCCCGCG CTGGCCTGTT GACCGGGATG
ACCGGCGTGG ACGATCCGTA CGAGGAGCCG ACCGACGCCG ACCTTGTGGT CGACACGACC
GACCTGTCGG TCGACGAGGC GGTGCAGCTG GTCCTGCACC AACTGGGTGA GACGGGCTGG
GTCGAGCCCC GCCATCCGAC CCTCTAG
 
Protein sequence
MSDGWVPPDG VLRDAPAYSP RPGELADLEL LLTGAYAPLT GFMTRADLVS VSRRGRLADG 
VPWPVAVTLQ VPAALAEGLN VRDPARRTLV LTDGEGAPMA ALEVADTWSV RPGVAGLGGS
VRRLGDGGHG PFQRLRRTPE EVRSLLPAGR VLGVVADRPL HRPQLAQIAH ATRTLGAHLL
VLIPVGEDGG ELPPEALVRS VFAARDRMPP ATLVAIPLPR RREEISDALL RARVSAAYGV
THLLSTGEML SGAGLRVLVP RELAYDNRDG QWRWREDIPP RNRRQALSQA EIDDLLDRGF
PLPEWHTPPA VAKELAHARP PRRHRGLVIF LTGLSGSGKS TIARGLADAL REQGERTITL
LDGDVIRREL SAGLGFSKAD RDANVRRISW VAAEIGRHRG VAICCPIAPY ARARATAREM
ALATGAGFLL IHVATPLEVC ERRDRKGLYA RARAGLLTGM TGVDDPYEEP TDADLVVDTT
DLSVDEAVQL VLHQLGETGW VEPRHPTL