Gene Saro_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1965 
Symbolpgk 
ID3917281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2083063 
End bp2084250 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content64% 
IMG OID640444713 
Productphosphoglycerate kinase 
Protein accessionYP_497239 
Protein GI87199982 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.159546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCA AGACCCTTGA CGACATCGGC GACCTGACCG GCAAGACCGT GCTGGTGCGC 
GAGGACCTCA ACGTGCCGAT GCAGGACGGC GCGGTCACCG ACGACACGCG TCTGCGCGCC
ACCATCGCGA CGCTGAACGA GCTGTCCGAC AAGGGCGCGA AGGTGCTCGT GCTGGCGCAC
TTTGGCCGCC CCAAGGGCCA GCCGTCGGAA GAATTTTCTT TGAAGAAGCT CGCTGCCCCG
CTCGCGCACG TACTGGGCCG TCCGGTCAGC TACATCGACT GGGAAAGCGA CAAGGCCGCT
GTGGCTGCTC TGACGCCCGG TGCGATTGCC GTGCTTGAGA ACACCCGCTT CTTCGACGGC
GAGGAAAAGA ACGACCCAGC CGTGATCGAG CGTTTCGCCA GCCTCGGCGA CATTTACGTC
AATGATGCCT TTTCCGCCGC CCACCGCGCC CACGCTTCGA CCGAAGGCCT GGCACACGTG
CTGCCGGCCT ATGCAGGCCG CGCCATGGAG GCCGAGCTCA AGGCATTGCA GAAGGCGTTG
GGGGAACCCG AACGTCCGGT GGCAGCCGTT GTTGGCGGGG CCAAGGTGTC GACCAAGCTC
GACGTGCTCA AGCACCTTGT CAGCAAGGTC GATCACCTGA TCATCGGTGG TGGCATGGCC
AACACGTTCC TTGCGGCGCG CGGCGTGAAC GTGGGCAAGT CGCTGTGCGA ACACGACCTT
ACCGGCACCG CCGAGGAAAT TCTCGACAAT GCCGACAAGT CGGGCTGCAC CGTTCACCTG
CCGTACGACG TGGTCGTTTC GAAGGAGTTC ACCGCAAACC CGCCGAGCCT GCGGACCTGC
AATGTTCATG AGGTCGCTGC AGACGAGATG ATCCTCGACG TGGGCCCGGC CGCGGTCGAG
GCGCTTGCTG ATGTGCTCAA GACCTGCAAG ACGCTGGTGT GGAACGGTCC GATGGGGGCG
TTCGAGACCG AGCCGTTCGA CGCCGCCACC GTGGCGTTGG CGCGCACGGC TGCAGCTCTG
ACCAAGGAAG GTTCACTCGT GTCGGTGGCG GGCGGGGGCG ATACCGTGGC TGCCCTGAAC
CATGCGGGCG TGGTTGGTGA TTTCTCTTAC ATCTCGACTG CAGGCGGCGC CTTCCTTGAG
TGGATGGAAG GAAAGGAATT GCCCGGCGTC GCGGCGCTGG AAGGATAG
 
Protein sequence
MSFKTLDDIG DLTGKTVLVR EDLNVPMQDG AVTDDTRLRA TIATLNELSD KGAKVLVLAH 
FGRPKGQPSE EFSLKKLAAP LAHVLGRPVS YIDWESDKAA VAALTPGAIA VLENTRFFDG
EEKNDPAVIE RFASLGDIYV NDAFSAAHRA HASTEGLAHV LPAYAGRAME AELKALQKAL
GEPERPVAAV VGGAKVSTKL DVLKHLVSKV DHLIIGGGMA NTFLAARGVN VGKSLCEHDL
TGTAEEILDN ADKSGCTVHL PYDVVVSKEF TANPPSLRTC NVHEVAADEM ILDVGPAAVE
ALADVLKTCK TLVWNGPMGA FETEPFDAAT VALARTAAAL TKEGSLVSVA GGGDTVAALN
HAGVVGDFSY ISTAGGAFLE WMEGKELPGV AALEG