Gene Sare_4547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4547 
Symbol 
ID5705809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5140169 
End bp5141329 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content72% 
IMG OID641273959 
Productpyruvate phosphate dikinase PEP/pyruvate-binding 
Protein accessionYP_001539306 
Protein GI159040053 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.529657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0228656 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGAC ACCGAGGGGA GTGGCTGTTG CTGATCGACC TGGCCGACGC CGAAGCCGCG 
ACGTCCGGAG GCAAGGCGGC AGTGCTCGCC CGGCTCCTCG AAGCAGGTCT GCCGGTCCCG
CCTGGTTTTG TGGTGCCGGC CTCCGCCTAC GAACAGGCAG CCGACGGCCC CTCCGCCGAG
CTCGCCGCCG CGATCGCCCA GGCGCTGCCG CGGCTCGGCG ACGGTCATGT CGCCGTACGC
TCCTCGGCGA CCAACGAGGA CACCGCCCAG GCCACCGCCG CAGGGCAGCA CGACACCTTC
CTCGGCGTCC GCGGGCCCGA CGAGGTGGTC GACGCCGTGA GCAGATGTTG GGCCTCGCTG
TGGTCCGAGC GCGCCGTGGA ATACCGGCGC CGGCGGGGAG ACACGGAGTC ACCGACGATC
GCCGTCCTGG TGCAGCGTCT AGTGGACGCG GACGTCGCTG GGGTGATGTT CACCGGCGAT
GACATCCGGC TGGAGGCGTC CTGGGGGTTG GGCGAGAGCG TCGTCAGCGG CCACGTAACA
CCGGACTCCT GGATGGTGTC CGGCGGCGAC ATCACCCATC GGGCGCTCGG CACGAAGAAG
ACCCGGATCG ACCGCACGAT CTGCCGCGAG GTGGAACCGG CCGACCGGGA TCGCTTCTGC
CTCACCGACG ACGAGGTCAC CCGGCTCGCA CAGCTCGGTC GGCAGATAGC CGCTCTGCTG
GGCGGCCCAC AGGACATCGA GTGGGCAATC GCCGATTCCC GGATCTGGAT ACTTCAGTCC
CGCCCGGTGA CCACCGCCCT CCCCGCCACA CCCCCGGCCG CCGCGGCCGC CGAGGGCAAG
GCCCTCACCG GTACGCCCGG AAGCCCGGGC ATCGCCACCG GACCGGCGCG CGTGGTGCGC
GGCCCCGCCG ACTTCGCCCG AGTCCGGCCC GGTGACGTAC TCGTCTGCCG CACCACGGAT
CCGTCGTGGA CCCCGCTGTT CGGCGTGGTC GCCGCCGTCG TCACCGAAGT CGGCGGCCTG
CTCTCGCACG CCGCGATCGT CGCCCGCGAG CAGGGCGTCC CTGCCGTCCT GGCCGTCCCG
GACGCGACGA CAGCCCTGCC CGACGGCGCG CCGGTGGAGG TGGACGGAAA CTCCGGCTCG
GTGGCACGCC GTGGTTCCTA A
 
Protein sequence
MHGHRGEWLL LIDLADAEAA TSGGKAAVLA RLLEAGLPVP PGFVVPASAY EQAADGPSAE 
LAAAIAQALP RLGDGHVAVR SSATNEDTAQ ATAAGQHDTF LGVRGPDEVV DAVSRCWASL
WSERAVEYRR RRGDTESPTI AVLVQRLVDA DVAGVMFTGD DIRLEASWGL GESVVSGHVT
PDSWMVSGGD ITHRALGTKK TRIDRTICRE VEPADRDRFC LTDDEVTRLA QLGRQIAALL
GGPQDIEWAI ADSRIWILQS RPVTTALPAT PPAAAAAEGK ALTGTPGSPG IATGPARVVR
GPADFARVRP GDVLVCRTTD PSWTPLFGVV AAVVTEVGGL LSHAAIVARE QGVPAVLAVP
DATTALPDGA PVEVDGNSGS VARRGS