Gene Ndas_4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4053 
Symbol 
ID9247925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4847909 
End bp4849126 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content74% 
IMG OID 
ProductROK family protein 
Protein accessionYP_003681955 
Protein GI297562981 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAG ACGCGCTGAC CCACACGGAG GGACGACCCG GCCCGGCCGT CGCTCCGACC 
GCCGCACCCG GTGCGGGCAC CCTGCTCCGA CTCCTGCGCG ACGGCCGACC GCGCACCCGT
TCGGAGCTGG CCTCGGTCAC CGGTCTGGCC CGCTCGACCG TCACCCAGCG CGTGGACGCC
CTGCTCGCCA GCGGACTCAT CGGGCCCGCC GGAGAGGCCG TGTCCACCGG GGGCAGGCCG
CCCACGACCT TCTCGTTCCA GCCGGACGCC CGCGTGGTCC TGGCCGCCGA CCTGGGCGCC
ACCCACGCCC GCCTGGCCCT CACCGACATG TCGGGGACCG TGCTCGCCGA GGAGCGCGCC
GACCTAGACA TCGCCCTGGG CCCCGAGCAC GTCCTGGACT GGGTCGTGGA GCGGGGCCGC
GGCCTGCTGG AGGGCACGGA ACGGCGGGTG GACGAACTGC TCGGCCTGGG CATCGGCCTG
CCCGGCCCGG TCCAGCACAC CACCGGGCGC GCGGTGAACC CGCCGATCAT GCCGGGGTGG
GACAACTTCG ACGTGCCCGG CTACGTCCAC GACCGGCTCG ACGTGCCGGT CCTGGTGGAC
AACGACGTCA ACATCATGGC GATCGGGGAG CACCACACGG CCTGGCCCGA GGCCAGCCAC
CTGATGTTCG TCAAGGTCGC CACGGGCATC GGCTGCGGCA TCGTCAGCGA GGGCAGCGTC
TACCGGGGAG CGCAGGGCGC GGCCGGGGAC ATGGGGCACA TCCACGTCCC CAGCGGCGAC
GAGCGGCCCT GCCGCTGCGG CAACACCGGC TGCCTGGAGG CCGTGGCCAG CGGCGCCGCC
CTGGCCGCCG CGCTCACCGC CGAGGGCGTG CCCGCCAGCG GAGCCCGCGA CGTGGTCGAA
CTGTCCCGCA ACGGGTCGGT GCCCGCCCTG CGCGCGCTGC GCCAGGCGGG CCGCGACATC
GGCGAGGTGC TCGCCGCGTC GGTGAACATG TTCAACCCGT CGGTCATCGT CATCGGCGGC
GCGCTCGCCC TGGCCGGGGA CCACCTGCTC GCCGGGGTGC GCGAGATCAT CTACCAGCGG
TCGCTGCCGC TGGCCACCGA ACACCTGAGC ATCGTCTCGT CCGTGGCGGG CGAGAGCGCG
GGCGTGGTCG GCGCGGCGGT CATGGTGATC GAGCACTGCC TGAGCCCCGA GCACGCGGAC
GCGCTGATCA ACGGCTGA
 
Protein sequence
MTEDALTHTE GRPGPAVAPT AAPGAGTLLR LLRDGRPRTR SELASVTGLA RSTVTQRVDA 
LLASGLIGPA GEAVSTGGRP PTTFSFQPDA RVVLAADLGA THARLALTDM SGTVLAEERA
DLDIALGPEH VLDWVVERGR GLLEGTERRV DELLGLGIGL PGPVQHTTGR AVNPPIMPGW
DNFDVPGYVH DRLDVPVLVD NDVNIMAIGE HHTAWPEASH LMFVKVATGI GCGIVSEGSV
YRGAQGAAGD MGHIHVPSGD ERPCRCGNTG CLEAVASGAA LAAALTAEGV PASGARDVVE
LSRNGSVPAL RALRQAGRDI GEVLAASVNM FNPSVIVIGG ALALAGDHLL AGVREIIYQR
SLPLATEHLS IVSSVAGESA GVVGAAVMVI EHCLSPEHAD ALING