Gene Nmag_3795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3795 
Symbol 
ID8826665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp177836 
End bp179044 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content61% 
IMG OID 
ProductPhosphoglycerate kinase 
Protein accessionYP_003481898 
Protein GI289583488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCCT TTCAGACGCT CGACGACCTC GAACCTGGGC AACGGCTTCT GGTCCGCATC 
GACGTCAACG CTCCCGTCGA GGACGGCGTC GTACAGGACG ACCGCCGATT CGCCCGCCAC
GCAGAGACCG TTCAGGAACT GCTCGCAGAC GACCACGCGA TCGCCTTACT TGCACACCAG
GGACGGCCGG GACGGGACAC GTTCGTTTCC CTCGACCAGC ATGCTGCAAT CCTCTCCGAC
CATCTCGATC GGCCCGTCGA ATTCGTCGCC GATACGTGCG GCGAGGAAGC GCTGTCTGCG
ATCGATGGCC TCGAACGCGG TGACGTGCTC CTCCTCGAAA ACGTGCGAAT GTGTGAGGGA
GAACTACCCG AAGAAGCGCC AGAGACCAAG GCCGAGACGG AACTGGTACG AACGCTTTCG
ACGGAATTCG ACGCGTACGT CAGCGATGCC TACGCGACGG CACATCGATC ACACGCGTCG
ATCGTTGGCT TTCCACTCGT TATGGATGCC TACGCAGGGC GCGTGATGGA ACAGGAGTAC
CGAGCAAACA CCGCGATCCG GGAACGAGCG TTCGACGGCC CCGTGACGAT GATTCTCGGC
GGAACGAAGG CCGAAGACAC CATTCCCGTC GTGGAACAAC TCGCGGATGT CGTCGATCAC
TTCTGTCTGG GCGGTATCAT CGGCGAACTG TTCTTGCGTG CCGACGGACA CGACCTCGGA
TACGATGTCG ACGGGACGGA ACTGTTCGAC CATCAGTGGG AAGCCCACAG CGAGACGATC
ACGGACGCAC TCGAGACGCA CGACACCACG GTGGTGCTCC CGACCGATCT TGCGTACAAA
GACGACGGCG ATCGAGCGGA AACCGCGGTC GAGGGGATCG AGAAGCAGAC ATCGTATCTC
GACATTGGTT CGGAGACGAT CGATCGCTAC ACCGACCGCA TCGCGGACTC CGAGGCAGTC
TACGTCAAAG GTGCCGTCGG TGTCTTCGAG GACGAGCGGT TCGCCAACGG TACCGTCGGG
ATACTTTCAG CGATCGCCGA CACCGACTGT GTGTCGGTTG TCGGTGGCGG TGATACAGCC
CACTCGATCG AACTGTACGA TCTCGACGAG GACGATTTCA CGCGCGTTTC GATCGCTGGC
GGTGCGTACG TCCGCGCTCT GACCGGTGCG TCCCTCGCGG GAATCGACGC ACTCGAGTGT
GACTCTTGA
 
Protein sequence
MTSFQTLDDL EPGQRLLVRI DVNAPVEDGV VQDDRRFARH AETVQELLAD DHAIALLAHQ 
GRPGRDTFVS LDQHAAILSD HLDRPVEFVA DTCGEEALSA IDGLERGDVL LLENVRMCEG
ELPEEAPETK AETELVRTLS TEFDAYVSDA YATAHRSHAS IVGFPLVMDA YAGRVMEQEY
RANTAIRERA FDGPVTMILG GTKAEDTIPV VEQLADVVDH FCLGGIIGEL FLRADGHDLG
YDVDGTELFD HQWEAHSETI TDALETHDTT VVLPTDLAYK DDGDRAETAV EGIEKQTSYL
DIGSETIDRY TDRIADSEAV YVKGAVGVFE DERFANGTVG ILSAIADTDC VSVVGGGDTA
HSIELYDLDE DDFTRVSIAG GAYVRALTGA SLAGIDALEC DS