Gene Ndas_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3020 
Symbol 
ID9246873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3607490 
End bp3608572 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content78% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003680936 
Protein GI297561962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.506245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACGCCA GGATCGCGCT GGTCATCGGT ACCAGCAGTG GAGGGGTGGG CCGCCATGTG 
CGCTCGCTGG GCGCGGGCCT GGCCGCGCGC GGCCACCGGG TGGCGGTGCT GGGCCCGGCC
TCCGCCGAGC GCGAGTTCGG CTTCACCTCG GAGGGCATGC GCTTCTCCCC GGTCGGCATC
GGAGCGGCGC CCTCCGCGGG CGATCCCGGC GCGGTGCTGC GCCTGCGCGC CCTGACCCGG
GGCGCGGACG TGGTCCACGC GCACGGGCTG CGGGCCGGGG CCCTGTGCGC GCTGGCCGGC
GCCTCTCCGC TGGTGGTGAC CGCGCACAAC GCGCCGCCGC TGGTGCGGGG GGCGCTGTCG
GCGGCCTACC CGGTGCTGGA GCGGATCGTG GCGCACCGGG CGGACGTGGT GCTGGGGGTG
TCGGGCGACC TGGTGCGCAG GCTGCGCTCG GTCGGGGCGC GCGACGCGCG GCTGGCGGTG
GTGGCCGCTC CCGAGACCGG CGCTCCGGTG AACGGGCGCG AGGCCACCCG GGCCGACCTG
GCGGTGCTGC CGGAGCGGCC GCTGCTGCTG ACCGTCGCCC GCCTGGCCGA GCAGAAGGGG
TTGGACATGC TCCTGGCGGC GGCGCCTGCC ATCGCCGACC GCCGCCCCGA ACCGGTGGTG
GCGATCGCCG GGGACGGGCC CCTGTGGGGG CAGCTGCACG ACACGGCCGC GGAGATGCGC
GCGGACGTGC GCATGCTGGG GCACCGCGCG GACGTGGCGG ACCTGCTGGC GGCGGCGGAC
GTGTTCTGCC TGACCAGCCA GTGGGAGGGG CCCTCACTGG TGATCATGGA GGCGTTGCGC
GCGGGGCTGC CGGTGGTCTC CACGCGGGTC GGCGGCATCC CGGACCTGTA CTCGGGGACG
GTGCTGATGG TGCCGCCGGG GGATCCCGCG GCCTTCGCCG CCGCCGTGGG CCGGGTGCTG
GACGACCCGG CGCTGGCCGA GGACCTGCGG GCGCGCTCGC GCGAGGCGGC CAAGGCGCTG
CCGAGCGAGG AGGACGCGGT GGAGGCCGCC GCGGGCGTGT ACAAGACGGT GCTGCGGCGG
TGA
 
Protein sequence
MNARIALVIG TSSGGVGRHV RSLGAGLAAR GHRVAVLGPA SAEREFGFTS EGMRFSPVGI 
GAAPSAGDPG AVLRLRALTR GADVVHAHGL RAGALCALAG ASPLVVTAHN APPLVRGALS
AAYPVLERIV AHRADVVLGV SGDLVRRLRS VGARDARLAV VAAPETGAPV NGREATRADL
AVLPERPLLL TVARLAEQKG LDMLLAAAPA IADRRPEPVV AIAGDGPLWG QLHDTAAEMR
ADVRMLGHRA DVADLLAAAD VFCLTSQWEG PSLVIMEALR AGLPVVSTRV GGIPDLYSGT
VLMVPPGDPA AFAAAVGRVL DDPALAEDLR ARSREAAKAL PSEEDAVEAA AGVYKTVLRR