Gene Ndas_5198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5198 
Symbol 
ID9249091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp342023 
End bp343303 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content77% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003683084 
Protein GI297564111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.603506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGT CCTTCGCCTC CCCCGCGGTG GCGGCACCGC CCCGGCCCGC CGTCGGCGCG 
ACTCGCCGCA CCCGGGTCCT GATCGGTACC GACACCTATC CCCCCGACGT GAACGGCGCC
GCGTACTTCA CCGCCCGCCT CGCCCGCGGT CTGGCCGCGC GCGGAGCGCG GGTGCACGTG
GTGTGCCCCT CCCCCGAGGG CGCCCCGTAC ACGGCGGAAC GCGGCGGGGT GGTCGAGCAC
CGGCTGCGCT CGGTGTCCTC CCTGGTCCAC GACAGCGTGC GGCTCGCGGT CCCGCTGGGC
GTGCGCGGCC ACCTGGACCG GCTCCTGGAC CGGGTGCGGC CGGACGCCGT CCACATCCAG
AACCACTTCC TCGTCGGCCG GATGCTGGCC GCCGCCGCGC ACGCCCGAGG CGTGCCCGTG
GTCGCCACCA ACCACTTCAT GCCGGAGAAC CTCTTCGACT ACGTGCACGT GCCCGCGCCG
CTGCGCCCGC ACGCGGCCCG GCTGGCCTGG TGGGACCTGG GCGCGGTGCT GTCCCGGGCC
GAGCACGTGA CCACGCCCAC CCCGGCGGCG GCGCGGCTGC TGGTCGACCA GGGGTTCACC
CGGCCGGTCG AACCGGTCTC GTGCGGGATC GACCTGGACC GGTTCAGCCC GCTGGACGGC
GGCGCGGCCA CCCGGCGGCG GCTGCGCGCC CGGCTGGGCG TGCCGGACCA CAGGACGGTG
CTGTTCGTGG GGCGGCTGGA CGAGGAGAAG CGCGTGGACG AACTCGTGCG CGCGGTGGCC
CTGACCGACG GGGTGCAGCT CGTGCTGGCC GGGCACGGCG CGCACCGGGC GCGGCTGGAG
GAGCTGGCGG CGGAGGTCGG CGCGGCCGAC CGGGTGGTGT TCCTGGGCTT CGTGCCGCAC
GCCGACCTGC CCGACGTGTA CCGGTGCGCT GACGTGTGGG CCATCGCCGG GACCGCCGAA
CTCCAGAGCA TCGCCACCCT GGAGGCGATG GCGAGCGGCC TGCCGGTGGT GGCGGCGGAC
GCCATGGCGC TGCCGCACCT GGTGGAGGAG GGCGGCAACG GGTACCTGTA CCCGCCCGGC
AGCCCGGGGG CGCTGGCCGC ACGCGTGGAG TCGGTGGTCG CCGACGAGGG CCGACGGCTC
GGGATGGGCG CGCGCAGCCG CGACATGGCG GAGCTGCACC GGCTGGAGGA CTCGCTGGAG
CGGTTCGAGC GGATCTACCG CGAGGCGTCC GCCGGTGCGG GGGCCGGCGC CGGAGCCGGT
GCGACGCGCA GCGGGCGGTG A
 
Protein sequence
MSTSFASPAV AAPPRPAVGA TRRTRVLIGT DTYPPDVNGA AYFTARLARG LAARGARVHV 
VCPSPEGAPY TAERGGVVEH RLRSVSSLVH DSVRLAVPLG VRGHLDRLLD RVRPDAVHIQ
NHFLVGRMLA AAAHARGVPV VATNHFMPEN LFDYVHVPAP LRPHAARLAW WDLGAVLSRA
EHVTTPTPAA ARLLVDQGFT RPVEPVSCGI DLDRFSPLDG GAATRRRLRA RLGVPDHRTV
LFVGRLDEEK RVDELVRAVA LTDGVQLVLA GHGAHRARLE ELAAEVGAAD RVVFLGFVPH
ADLPDVYRCA DVWAIAGTAE LQSIATLEAM ASGLPVVAAD AMALPHLVEE GGNGYLYPPG
SPGALAARVE SVVADEGRRL GMGARSRDMA ELHRLEDSLE RFERIYREAS AGAGAGAGAG
ATRSGR