Gene Namu_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2153 
Symbol 
ID8447764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2374096 
End bp2375316 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID645041276 
Productglucose-1-phosphate adenylyltransferase 
Protein accessionYP_003201520 
Protein GI258652364 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0448] ADP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR02091] glucose-1-phosphate adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000824588 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00511814 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAAGC CCAAGGTCCT CGGCATCGTA TTGGCGGGTG GCGAAGGAAA AAGGTTGTGG 
CCGTTGACGG CCGACCGCGC CAAGCCTGCG GTGCCGTTCG GCGGTAACTT CCGCCTCGTC
GATTTCGTCC TGTCCAATAT GGTGAACGCC GGATACTTCC GGATCTGCGT GCTCACCCAG
TACAAATCGC ATTCGCTCGA CCGGCACATC ACGCAGACCT GGCGGATGAG CACGATGGCC
GGCAACTACG TGACGCCGGT CCCGGCCCAG CAGCGGCTCG GCCCGCGCTG GTACACCGGG
AGCGCGGACG CCATCCTGCA GTCGCTCAAC CTCGTCTATG ACGAAAAGCC CGACTACCTG
GTGGTTTTCG GTGCCGACCA CGTGTACCGG ATGGATCCGG CGCAGATGGT GGCCGACCAC
ATCGATTCGG GCGCCGACAC CACCGTGGCC GGGATCCGGG TGCCTCGCCG AGAGGCGACC
GCCTTCGGCG TGATCAAGAC GGCGGCCGAC GGCCGGCACA TCGCGGAGTT CATGGAGAAG
CCGGCCGACC CGCCGGCGGT GCCCGACGAT CCGGACGTGG CCTACGCGTC CATGGGCAAC
TACGTGTTCT CCACCGGTGC GTTGATCGAG GCCCTCAAGA TCGATGCGGC CGACGAGGCG
TCCGTGCACG ACATGGGCGG CAACATCATC CCGTACTTCG TGAACAAGGG CACCGCCAAC
GTCTACGACT TCGCCCGGAA CAAGGTGCCC GGGGCCACCG ACCGTGATCG CGGCTACTGG
CGGGACGTGG GCACGCTGGA CGCGTTCATG GACGCGCACA TGGACCTGAT CTCGGTCGAG
CCCATTTTCA ACCTGTACAA CCACGATTGG CCGATCCTGA GCTACCCGGC ACCCTTCCCG
CCGGCCAAGT TCGTCGAGGA CGGCACCGCG CGTGACTCGA TGATCGGCAC CGGCACGATC
ATCTCCGGGG CGACGGTGAC CCGGTCGGTC ATCGCCGAGG ACGTGCACGT CAACACCGGC
AGCCGGGTCG AGGGCTCGGT GATCATGCCG GGCGTGCGGA TCGGCCGCAA CGCGGTGGTG
CGGCACAGCA TTCTGGACAA GAACGTGATC GTGCCCGACG GCGCCAAGGT CGGCGTGGAC
GTCGAACTGG ACAGCGATCT GTACACGGTC AGCCCCGGTG GCATCACGGT CGTCGGCAAG
GGCGTGACGA TCGGCAAATA G
 
Protein sequence
MPKPKVLGIV LAGGEGKRLW PLTADRAKPA VPFGGNFRLV DFVLSNMVNA GYFRICVLTQ 
YKSHSLDRHI TQTWRMSTMA GNYVTPVPAQ QRLGPRWYTG SADAILQSLN LVYDEKPDYL
VVFGADHVYR MDPAQMVADH IDSGADTTVA GIRVPRREAT AFGVIKTAAD GRHIAEFMEK
PADPPAVPDD PDVAYASMGN YVFSTGALIE ALKIDAADEA SVHDMGGNII PYFVNKGTAN
VYDFARNKVP GATDRDRGYW RDVGTLDAFM DAHMDLISVE PIFNLYNHDW PILSYPAPFP
PAKFVEDGTA RDSMIGTGTI ISGATVTRSV IAEDVHVNTG SRVEGSVIMP GVRIGRNAVV
RHSILDKNVI VPDGAKVGVD VELDSDLYTV SPGGITVVGK GVTIGK