Gene Ndas_4902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4902 
Symbol 
ID9248789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp31755 
End bp32894 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content77% 
IMG OID 
Productgalactokinase 
Protein accessionYP_003682791 
Protein GI297563818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.227753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGACG TGACGACCGC GTTCCACTCG GCGTTCGGGT ACGGGCCCGA GGGCGTGTGG 
ACCGCGCCGG GGCGGATCAA CCTCATCGGC GAGCACACCG ACTACAACGA CGGGTTCGTG
CTCCCCTTCG CCCTGCCCCA CGCGCTGACG GCCTCGGCCG CCCGGCGGAC GGACGGGCGG
GTGCGGCTGC TCTCCCGGCA GTCGCCGGAG GAGTACGCGG GCCCGGTGGG CGACCTGGTC
CCCGGCGCGG TGGAGGGGTG GGCCGCCTAC CCCGCCGGGG CGCTGTGGGT CCTGCGGGAC
GAGGGCCACC CGGTGGACGG GCTGGACCTG CTGGTGGACA GCACGATCCC GAGCGGGGCG
GGCCTGTCCT CGTCGGCCGC GCTGTCGTGC GCGGCCGTCA TGGCCGCGGC CTCCCTGTAC
GGGGCCGACC TCGCGCCGGG CGGGGTGGCC CGGCTGGCCC AGCGGGTGGA GAACGACTTC
GTGGGCATGC CCTGCGGGAT CCTGGACCAG TCCGCGTCCA TGCTCTCCAC CGAGGGGCAC
GCCCTGTTCA TGGACACGCG CACCCTGGAG ACCGAGCAGG TGCCCTTCGA CCCCTCCGCG
GACGGGCTGA CCGTGCTGGT GGTGGACACC CGCGCCCCGC ACCGGCACGT GGACGGCGCC
TACGCCGAGC GGCGCCGCTC GTGCGAGGAG GCCGCGCGCG TCCTGGGGGT GGCGGCCCTG
CGCGACGTCA CCGACCTGCC GGGCGCCCTG GCCGCGCTGC CCGACGACGT GTCCCGCCGC
CGGGTGCGCC ACGTGGTGAC CGAGAACGGG CGGGTGCTGC GGGCCGTGGA CCTGCTCCGG
TCCGGGCGCA CACGGGAGGT GGGGCCGCTG CTCACCGCCT CCCACGCCTC GCTGCGCGAC
GACTACGAGG TGAGCGTGCC CGAGGTGGAC ACCGCGGTGG ACGCGCTGCT GGCCGCGGGC
GCGCTGGGGG CCAGGATCAC CGGCGGCGGC TTCGGCGGGT GCGTGGTCGC CCTGGTGGAG
ACCGGGCGCG TGGAGGCCTG CGGGAAGGCG GTGCTGGAGG CCTACCGGGA GCGGGGCTTC
GAGGAACCGG CCGCGTTCGG TGCCCTGCCG TCCGCGGGGG CGCGCCGTCT GCACCCCTGA
 
Protein sequence
MDDVTTAFHS AFGYGPEGVW TAPGRINLIG EHTDYNDGFV LPFALPHALT ASAARRTDGR 
VRLLSRQSPE EYAGPVGDLV PGAVEGWAAY PAGALWVLRD EGHPVDGLDL LVDSTIPSGA
GLSSSAALSC AAVMAAASLY GADLAPGGVA RLAQRVENDF VGMPCGILDQ SASMLSTEGH
ALFMDTRTLE TEQVPFDPSA DGLTVLVVDT RAPHRHVDGA YAERRRSCEE AARVLGVAAL
RDVTDLPGAL AALPDDVSRR RVRHVVTENG RVLRAVDLLR SGRTREVGPL LTASHASLRD
DYEVSVPEVD TAVDALLAAG ALGARITGGG FGGCVVALVE TGRVEACGKA VLEAYRERGF
EEPAAFGALP SAGARRLHP