Gene Ndas_5208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5208 
Symbol 
ID9249101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp358087 
End bp359364 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003683094 
Protein GI297564121 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.330402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.673241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATCG CGATGGTCGC CGAACACGCC AACCCCCTCC CCGCCCACAG GGGCGAGCCC 
GCCTGCCCCG CCAGCCTGCA CGTGTGCGCC CTGTCCCGGC AGCTGGCCAA GCGGGGCCAC
CGGGTGACGG TCTACGCCCG CCGCAGTGAC CCCGACCAGC CCGACGGCCG CACCCGCATG
GCGCGCGGCG TCTCCGTCGC CTACCTGGAC GCCGGCCCGG CCCGGCCGCT GTCCCCCGAG
GAGCACGCCG AGCACACCGG CGCCTTCGGC AGCGCCCTGG CCTCCGTCCT GGACGAGGAC
AGCCCCGACG TCCTGCACGC GATCGGCTGG ACCAGCGGCC TGGCCGCCCT GCACGCGCAG
GCGCACAGCG AGAGCGACCA GACCGGCACG CCCCTCGTGC AGACCTTCCA CTCCCTCAAC
GCCAGCGAGC AGCGCTCCGG CCTCGGCCAC CACCCCGAGC GCGCCCGCAT GGAGACCATC
CTCGCCTCGC GCGCCGACCG CGTGCTGGTC AACTCCACCG ACCAGCAGGT CGAGCTGGCC
CGCCTGGGCG TCCCCCGCCA CCACGTCAAC GTCGTGCCCT TCGGTGTGGA CCCCGACCAC
TTCAGCGTGG AGGGCAGCGC CTCCGCCGAG CACTGGCACT CCCGGCGCGA GGAGCGCGCC
CGCCTGGTCT CGGTCACCTC CCTGACCGAG GCCGGCGGCG CCGACCGGCT CGTGGAGGCC
ATGACCCGCC TCCCCGAGGC GGAGCTGCTG CTCGTCTCCA CCGCCGAGGA CCTGGACGTG
GCCCTGGACG AGAACGCCCG CCGGATCGAG CTCCTGGCCA AGGAGGCCGG GGTGAACGAC
CGCGTCCACC TGGCCGGGCC CGTGGAGCGC AAGGAGCTGC CGCGCCTGCT GCGCTCCGCG
GACGTGTACG TGTCCGCCGC CTCCTACGAC CCCTACGGCG GGGCCGTGCT GGAGGCCATG
GCGTGCGGCC TGCCCGTGGT GGCCACCGCC ACCGGCGCCA CCCCGGGGGC CGTCCTGCAC
CGCACCAGCG GCGTGCTGAT GCGCTTCGGC CGCCCCGACG AGGTCGTGCG CTCCGTGCGC
GCGGTCCTCA ACACCCCGAC CATGAGCACC GCGTACGGCA TCGCCGCCGT GGACCGGGCC
CGCTCCCGGT TCACCTGGCA GCGGATCGCC GTCGAGACGG AGCTCGCCTA CGAGCGCTCC
CGCCCGCAGC AGACGGAACA GGACCGCGCC GACGAGGACG AGACGGACGG TCTGCTACTG
TCCGGGACCG CGCACTGA
 
Protein sequence
MKIAMVAEHA NPLPAHRGEP ACPASLHVCA LSRQLAKRGH RVTVYARRSD PDQPDGRTRM 
ARGVSVAYLD AGPARPLSPE EHAEHTGAFG SALASVLDED SPDVLHAIGW TSGLAALHAQ
AHSESDQTGT PLVQTFHSLN ASEQRSGLGH HPERARMETI LASRADRVLV NSTDQQVELA
RLGVPRHHVN VVPFGVDPDH FSVEGSASAE HWHSRREERA RLVSVTSLTE AGGADRLVEA
MTRLPEAELL LVSTAEDLDV ALDENARRIE LLAKEAGVND RVHLAGPVER KELPRLLRSA
DVYVSAASYD PYGGAVLEAM ACGLPVVATA TGATPGAVLH RTSGVLMRFG RPDEVVRSVR
AVLNTPTMST AYGIAAVDRA RSRFTWQRIA VETELAYERS RPQQTEQDRA DEDETDGLLL
SGTAH