Gene Ndas_3774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3774 
Symbol 
ID9247643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4535753 
End bp4536871 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content78% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003681678 
Protein GI297562704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.291858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.904998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCGG CGGCGCGGGT GGCCCTGGTC GGGCCCGCCC ACCCCTACAA GGGCGGCGGC 
GCCCGGCACA CCACCGAGCT GGCGCACCGG CTGAGCGCGC TGGGCCACCC CACGGCCGTG
GAGTCCTGGC GGGCGCAGTA CCCGGCCGCC CTCTACCCCG GACAGCAGAC CATCGAGGTC
CCCGAGGGCG AGCCCTACCC CGGCACCCGC CACGAGCTGG CCTGGTACCG GCCGGACGGG
TGGTGGCGGA CGGGGCGGCG CCTGGCCCGC GAGGCCGACC TGGTGGTGCT CACCCTGTTC
TCCCCGGTGC AGGTGCCCGC CTACCTCGGC GTCCTCGCCG GGGTGCGCTC CGTGCGGCGT
TCCGGGGGCG CGCGGGTGGT GGCGCTGTGC CACAACGTGC TGCCCCACGA ACGGCGTGCC
GTGGACGTGT CCCTCGTGCG GGCCCTGCTG CGGCGGGTGG ACGGCGTGGT CGCCCACTCG
CCCGAGCAGG CGCGCCTGGC CGAGGGGCTC GGGGGCCCCG GGGCGCGGCG GCCCGTGGTC
GCCCAGATGC CCCCGCACCT GCCCGAGACC GGCGGCGCGG TCGCCCCGGC GCCGGGGGAG
CGGCGGCACC TGCTCTTCCT GGGCATCGTC CGCCCCTACA AGGGGGTGGA CCTGCTGCTG
CGCGCCCTGG CCGACGGAGC GCCCGGGGAC GTGGCGCTCA CCGTGGCCGG GGAGTTCTGG
GGCGGCACAG CGGAGCTGGA GGAACTGGCC GCCGGGCTGG GGATCGCCGA CCGGGTGCGG
CTGCGGGACG GGTACGTCCC CGCCGCCGAG CTGCCGGAGC TGTTCGCCTC GGCCGACGCG
GTGGTGCTGC CCTACCGCAC CGCCACCGCC ACCCAGAACG TGTGGCTGGC GCACGAGCAC
GGGATTCCGG TGGTGGCCAC CCGGGCCGGG ACCCTCCCCG ACCACGTGCG CGAGGGGGTC
GACGGCCTGC TGTGCGCGCC GGGCGACGCC GCCGACCTGG CCCGCGCGCT GGGGGAGTTC
TACGCGCCGG GGGAGCCCGA GCGGCTGCGC GCCGGGGTCC GCCCGGTGGA GACCGAGCCG
TACTGGCGGG CCTACACCGA GCGGTTGCTC GGGGCCTGA
 
Protein sequence
MSPAARVALV GPAHPYKGGG ARHTTELAHR LSALGHPTAV ESWRAQYPAA LYPGQQTIEV 
PEGEPYPGTR HELAWYRPDG WWRTGRRLAR EADLVVLTLF SPVQVPAYLG VLAGVRSVRR
SGGARVVALC HNVLPHERRA VDVSLVRALL RRVDGVVAHS PEQARLAEGL GGPGARRPVV
AQMPPHLPET GGAVAPAPGE RRHLLFLGIV RPYKGVDLLL RALADGAPGD VALTVAGEFW
GGTAELEELA AGLGIADRVR LRDGYVPAAE LPELFASADA VVLPYRTATA TQNVWLAHEH
GIPVVATRAG TLPDHVREGV DGLLCAPGDA ADLARALGEF YAPGEPERLR AGVRPVETEP
YWRAYTERLL GA