Gene Ndas_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3853 
Symbol 
ID9247724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4625698 
End bp4626852 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003681756 
Protein GI297562782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.208611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.427246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCGC TCGTGGCCAC GGTGGTGCAC CACCCGGAGG ACGCACGGAT CCTGCACCGG 
CAGATCCGCG CCCTGTTGGA CGCCGGACAC AGCGTGACCT ATGTGGCGCC GTTCCGCGAG
TGCGGGGTGA CCCCCTGGTC GGAACTGCGC TCGGTGGACG TGCCGCGCTC CTCGGGGCGC
GAGCGGCTCG CCTCGCTGCG CGCCGCCCGC GCGGTGCTCG CCGAGCAGGC GCCCCTGGCC
GACCTGCTGC TCTTCCACGA CCCCGAACTC CTGATGGCCC TGCCGTCCAG ACGCCCGGTG
ACGGTGTGGG ACGTGCACGA GGACACGGCG GCGGCCCTGC TCACCAAGGC GTGGGTGCCC
CGGGCGCTGC GGCGTCCGCT GGGCACGGTG GTGCGCTCCT TCGAGCGGCA CGCGGAGCGG
CGGATGCGGC TGATGCTGGC CGAGGAGGGG TACCGCTCCC GGTTCCGCCT GGAGCACCCG
GTGGTGCCCA ACACCACCGA GGTGCCGGAG TTCCCGGCGC GCGAGCCGGG CGACGACCGG
ATCGTGTACC TAGGCCAGGT GTCCGAGGCG CGCGGCGCGC GCGAACTGGT GGAGCTGGGG
CGCATGCTGC GCCCGCACGG CGTGCGCCTG GAGGTGATCG GCGGGGCCGA CGCCGGGGTG
CGGCCGCTGC TGCGCGAGGC CCAGCAGGAG GACGTCCTGC ACTGGTACGG GTTCGTGCCC
AACGACCGGG CGCTGCGGAT CTGCGCGGGC GCCATGGCGG GGCTGAGCCT GCTGCACGAC
ACGCCCAACT ACCGGCACTC GATGCCGACC AAGGTCGTGG AGTACATGGC GCACGGCCTG
CCGGTGGTGA CCACGCCCAA CCCGATGGCA CAGGAGCTGG TGACCGGCCG TCCGGAGGGC
CCGTCGGGCC TGGTGGTGCC GTTCGGGGAC GTGTCGGCCG CGGCGGAGTC GGTGCTGCGG
CTGCGCCGGG ACGCGGAGCT GCGCCGGAAC CTGGCGCGCA CCGGGCACCG GATCGCGCGG
ACCTCCTTCC ACTGGCCGGT CCAGGCGCGC CTGTTCGTCA AGCGCCTGGA GGCGTGGGCG
GACGAGGCCT CCGGCGGGCC GCTGGCCGTC GTGCCGCCCC CGCGGAGTCG CCAGCGCACT
CCCGTGCGTG ACTGA
 
Protein sequence
MHALVATVVH HPEDARILHR QIRALLDAGH SVTYVAPFRE CGVTPWSELR SVDVPRSSGR 
ERLASLRAAR AVLAEQAPLA DLLLFHDPEL LMALPSRRPV TVWDVHEDTA AALLTKAWVP
RALRRPLGTV VRSFERHAER RMRLMLAEEG YRSRFRLEHP VVPNTTEVPE FPAREPGDDR
IVYLGQVSEA RGARELVELG RMLRPHGVRL EVIGGADAGV RPLLREAQQE DVLHWYGFVP
NDRALRICAG AMAGLSLLHD TPNYRHSMPT KVVEYMAHGL PVVTTPNPMA QELVTGRPEG
PSGLVVPFGD VSAAAESVLR LRRDAELRRN LARTGHRIAR TSFHWPVQAR LFVKRLEAWA
DEASGGPLAV VPPPRSRQRT PVRD