Gene Rcas_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2070 
Symbolpgk 
ID5539550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2655583 
End bp2656770 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content63% 
IMG OID640894205 
Productphosphoglycerate kinase 
Protein accessionYP_001432174 
Protein GI156742045 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA AGACGATCCG CGACATCGAT TGGAGCGGCA AACGCGCTCT TGTTCGCGTC 
GATTTCAATG TGCCGCTGGA GAACGGGCAG ATCACCGACG ATACCCGCAT CCGTGCAGCA
CTTCCAACGA TCCGCTACCT GCTGGAACAT GGTGCAGCCG TCATTCTTAT GTCGCACCTG
GGGCGACCGA AGAATAAAGT CGTCGAAAGC ATGCGCCTGG CGCCGGTGGT GGCGCGCCTG
GCGGAACTGC TCCCCGAAGC GAAGGCGGTC AAAGGCTCAC AGGCAACCAC CGGTCCTGCG
GCTGAGGCGG CTGCGCGCGA CCTCAAACCG GGCGAGGTGC TGGTGCTCGA AAACACCCGA
TTTGACCCGC GCGAGGAAGC CAACGACGAA AGTATGGCGC GCGAACTGGC AAAACTTGGC
GATGTCTACG TCAACGACGC CTTCGGCTCG GCGCACCGCG CCCATGCCTC GACTGAAGGC
GTGGCGCGAT TCCTCCCCGC TGTCGCCGGC TTCCTGATGG AAGCCGAACT CGCCGCGCTC
CAGGGAGCGC TGGAAAATCC GGCGCGGCCA TTTGTCACTA TCATCGGCGG CGCCAAGATC
AGCGACAAAA TTGGCGTGAT CGAAAACCTG CTCGGCAAAG TCGATGCACT GCTGATCGGC
GGTGGCATGG CAAACACCTT TCTGCTTGCT CAGGGGCACG AGATGGGCGA CTCGCTGGTC
GAGCCGGACT CCGCTCCCAT TGCGAAGTCC TTGCTGGATC AGGCGGCGCA ACGCGGCGTG
CGCCTCATGC TGCCGACCGA CGTCGTAATT GCGGATGCCT TCAGCGCCGA TGCCAACCGC
AAGGTTGTGC CGGTGGGCGA AATTCCGCCG GGCTGGCGCG CGCTGGATAT CGGGCCTGAA
ACGATCCGCG CGTACACCGA GGTCATCACG GGCGCACAAA CCGTCATCTG GAATGGACCG
ATGGGGGTGT TCGAACTGGC GCCATTTGCC GAGGGCACCC GAGCGATCGC ACAGGCAATG
GCGAATTGCC CGGGGATGAC GATTATCGGC GGTGGCGACT CGGTTGCAGC GATAGAACAG
ATGGGACTTG CCGATAAGAT TCGCCACATC TCGACCGGTG GCGGCGCGTC GCTGGAACTG
CTGGAAGGGC GTATCCTGCC AGGCGTCGCA GCGCTGAATG ACGCATAA
 
Protein sequence
MAKKTIRDID WSGKRALVRV DFNVPLENGQ ITDDTRIRAA LPTIRYLLEH GAAVILMSHL 
GRPKNKVVES MRLAPVVARL AELLPEAKAV KGSQATTGPA AEAAARDLKP GEVLVLENTR
FDPREEANDE SMARELAKLG DVYVNDAFGS AHRAHASTEG VARFLPAVAG FLMEAELAAL
QGALENPARP FVTIIGGAKI SDKIGVIENL LGKVDALLIG GGMANTFLLA QGHEMGDSLV
EPDSAPIAKS LLDQAAQRGV RLMLPTDVVI ADAFSADANR KVVPVGEIPP GWRALDIGPE
TIRAYTEVIT GAQTVIWNGP MGVFELAPFA EGTRAIAQAM ANCPGMTIIG GGDSVAAIEQ
MGLADKIRHI STGGGASLEL LEGRILPGVA ALNDA