Gene Sare_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4073 
Symbol 
ID5705368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4633973 
End bp4635100 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content71% 
IMG OID641273499 
Productacetate kinase 
Protein accessionYP_001538854 
Protein GI159039601 
COG category[C] Energy production and conversion 
COG ID[COG0282] Acetate kinase 
TIGRFAM ID[TIGR00016] acetate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0369038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGG TACTGGTACT CAACTGCGGA TCGTCATCGG TCAAGTGGCG GTGGTATGAC 
GGCGACGAAC TCCTCGACCG GGGCGCCGTC GAGCGAATCG GCGAGTCCGG TGGTGGGCCG
GCCGACCATG GCACGGCGGT CCGGGAGATC CTCACCGGGC TCGACCTGGC CGGGCTCACC
GCCGTCGGAC ACCGGGTGGT GCACGGTGGA CGCCGCTTCG GCGAACCGGT CCTGATCGAC
GACGCGGTGC TCACCGCGAT CCGGGGCCTG ATACCGCTCG CCCCGCTACA CAATCCCGCC
AACCTGGCCG GCATCGAGGT CGCCAGGGCG GCGCTACCCG GCATCCCACA GGTTGCCGTC
TTCGACACCG CCTTCCACAC CACGCTGCCC GAGTCCGCGG CCACCTACGC GATCGACCGT
GCGACGGCGG ACCGGTACGG CATTCGACGG TACGGTTTCC ACGGCACCTC CCACGCGTAC
GTCTCGCGGC GCACGGCGGA GTTGATCGGT CGCCCGTACG CCGAGACCAA CACCATCACC
CTGCACCTGG GCAACGGCGC GAGCGCCGCC GCTGTCGCCG GCGGACGGAG CGTGGCCACC
TCGATGGGCA TGTCCCCGCT CGAAGGACTG GTCATGGGCA CCCGCAGCGG CGACCTGGAC
CCGACGGTGA TCTTCCACCT GCGGCGGGAA GGCGGGTTGA GCGTCGACGA AATCGACGAC
CTGCTGAACC ATCGCAGCGG CCTGTACGGG CTCACCGGCG CCAACGACAT GCGTGAGGTG
CTTACGCGAC GGGCGGACGG CGACCCGGCC GCCGCGCTCG CCTTCGACGT GTACTGCCGC
CGCATCACCG GCTACGTCGG GGCGTACTAC GCGCTGCTCG GCCGGGTGGA CGCGGTGACC
TTCACCGCGG GCGTCGGCGA GCACGCCGCC CCGGTCCGGG CGGCAGCGTT GGCCGGACTG
GAGCGACTTG GTATCACCGT CGATCCGGAA CGTAACGCGG GCCATGGTGA CCGCGTCATC
TCACCCGACG GCGGCGAGGT GGCGGTCTGC GTCATCGGCA CCGACGAGGA ACGGGAGATT
GCCCGCGCCG CCCGCGAGGT GGCGGGCGGG GCTCAGGTCG ACCGGTAG
 
Protein sequence
MSRVLVLNCG SSSVKWRWYD GDELLDRGAV ERIGESGGGP ADHGTAVREI LTGLDLAGLT 
AVGHRVVHGG RRFGEPVLID DAVLTAIRGL IPLAPLHNPA NLAGIEVARA ALPGIPQVAV
FDTAFHTTLP ESAATYAIDR ATADRYGIRR YGFHGTSHAY VSRRTAELIG RPYAETNTIT
LHLGNGASAA AVAGGRSVAT SMGMSPLEGL VMGTRSGDLD PTVIFHLRRE GGLSVDEIDD
LLNHRSGLYG LTGANDMREV LTRRADGDPA AALAFDVYCR RITGYVGAYY ALLGRVDAVT
FTAGVGEHAA PVRAAALAGL ERLGITVDPE RNAGHGDRVI SPDGGEVAVC VIGTDEEREI
ARAAREVAGG AQVDR