Gene Ndas_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3146 
Symbol 
ID9247002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3763940 
End bp3765094 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content76% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003681061 
Protein GI297562087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.833609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGAA CTCTGATCAT CACCAACGAC TTCCCGCCCA AGGCCGGTGG CATCGAGGCC 
TTCGTCCACG AGATGGCCCT GCGCCGCCCC CGCGGATCGG TCGTGGTCTA CTGCTCCTCC
CCCGCCCGGG CCGACGCCGC CGCCGACCCG CACTTCGACC TGCGCCAGCC CTTCCCCGTC
GTGCGCGACG CCGCACGCGT CCTGCTGCCC ACGCCCCGGG TGGCGCGGCG GGCACGCGCC
ATCGCCGACC TGGAGGGCTG CGACACCGTG CTCTACGGCG CCGCCGCCCC GCTGGGACTG
CTCGCCGCCG GGCTGGGCGA GGGCACCCCC GTCAAGCGCC AGGTGGCCAT CAGCCACGGC
CACGAGACCT GGTGGGCCAC CATGCCCGGA TCCCGCGAGG CGCTGCGCCG CATCGGCGAC
ACCACCGACA CCGTCACCTA CCTGGGCGAG TACACCCGGC GCCACCTGGC CCGGGCCCTG
TCCCCCGACG CGGCCGCCCG CATGCGCCGG CTCACGCCCG GCGTGGACAC CGGGGCCTTC
CGGCCCGGCA CCGGTGGCGA GGAGGTCCGC GCGCGCCTGG GCCTGGGCGA CCGCCCCGTG
GTGCTGTGCG TGTCCCGGCT CGTACCGCGC AAGGGGCAGG ACACCCTGAT CCGCGCCTGG
CCCCGCGTCC TGGCCGACGT GCCCGAAGCG GTCCTGCTCG TCGTCGGCGA CGGCCCCCAC
CGCCGGAGCC TGCTCTCGGC CGCGCGCGGA ATGGACTCGG TGGTCTTCAC CGGCTCGGTC
CCCCATCGGG ACCTGCCGCC CTACTACGAC GCCGCCGACG TGTTCGCCAT GCCCTGCCGC
AGCCGCAAGG GCGGCCTGGA GGCCGAGGGG CTGGGCATCG TCTACCTGGA GGCCTCGGCC
TGCGGCCTGC CCGTGGTCGC GGGCGACTCC GGGGGCGCAC CCGCCACGGT CCGGGACGGC
GAGACCGGCC TGGTCGTGGA CGGATCCCTG CCCGGCCCCT CCGCGCGCGC CCTCATCGCC
CTACTGAAGG ACCCCGAGCG CGCCGCCCAG ATGGGCGCAC GCGGCCGCGC GTGGGTGAGC
CGTGAGTGGA CCTGGGAACA CACCGCCAGG CGCCTGGACG CCCTCCTGGA GGGCTCCCCG
GACCTGCCCG CCTAG
 
Protein sequence
MPRTLIITND FPPKAGGIEA FVHEMALRRP RGSVVVYCSS PARADAAADP HFDLRQPFPV 
VRDAARVLLP TPRVARRARA IADLEGCDTV LYGAAAPLGL LAAGLGEGTP VKRQVAISHG
HETWWATMPG SREALRRIGD TTDTVTYLGE YTRRHLARAL SPDAAARMRR LTPGVDTGAF
RPGTGGEEVR ARLGLGDRPV VLCVSRLVPR KGQDTLIRAW PRVLADVPEA VLLVVGDGPH
RRSLLSAARG MDSVVFTGSV PHRDLPPYYD AADVFAMPCR SRKGGLEAEG LGIVYLEASA
CGLPVVAGDS GGAPATVRDG ETGLVVDGSL PGPSARALIA LLKDPERAAQ MGARGRAWVS
REWTWEHTAR RLDALLEGSP DLPA