Gene Ndas_1499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1499 
Symbol 
ID9245349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1837908 
End bp1839278 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content77% 
IMG OID 
Productformyl transferase domain protein 
Protein accessionYP_003679435 
Protein GI297560461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.290814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000921551 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAGACC GTCGGCGCTA CTGCTACGCC TCCGGGCTGG GCCTCGGTGT TCCGGCGCTG 
GAGGAGCTGT GCGCCCGAGG CTTCCCGCCC GGTCTGGTGG TGTCCCACCC CGCCGAGTTG
GCCCACTGCT CCGGCTACCA CGACTACGGG GCCCTCGCCG ACCGGCTCGG TCTGCCGCAC
CTGCGCGCCG CCCTCGACTC GGGCGAGGTG CGCGAGGCCC TCACCTTCCA CGGCATCGAC
CTGATGGTCG TCGCGGGCTG GTCGGGAACC GTCCCGGAGG AGGTCCTGTC CTCCCTGGCT
CTGGGCGGGG TCGGGCTGCA CCCGGCGCCG CTGCCCGTCG GCCGGGGCCG GGCGCCCATC
CCGTGGACGA TCCTGCGCGA CATGCGCTCC AGCGCCGTGA CCCTCTTCCA CATCGAAGGG
GAGGAGCACA GCGGCGACAT CGTCGACCAG GCCTGGTTCG ACGTGGCCCC GGACGCCACC
GCCGCCGGTC TGTACGAACG CGTGGGGCTC CTCCAGGCGG AACTGCTCGT GCGCCACATG
GAGGGCCTGC TGGAGGGCAC CGCGCCCCGG CGGCCGCAGA GCGGCCACGC GTCGGTGTGG
CCGCGCCGGC GCCCCAGCGA CGGCCACCTC GACCTCACCG CCTCCGGAAG CGACGTGGAC
CGGATGGTGC GCGCGCTGGC CGAGCCCTAC CCGGGGGCGT TCGCGATGTT CGGCAGCGCC
CGGATCACGC TGTGCTCGGG ACGCCTGGTG GGCGGGGTCG CCGGCGGGGC GCCCGGGCAG
GTGGTCGCGA CCGGCCGGGG GCGGGAGTGG GGGATCACCT GCGGGGACGG GGCCGTGTTC
GTGCCCGAGG TGCTGCGGGT GGACGAGGGG GTGCGCGCCC GGCCGACCTC GTTGGCGATG
TTCCGGCCCG GGACCTTCTT CGAGGCCCCC TCCCAGCACA TGCTGGAGGG CACCCGGCGG
GCGCCCCTGC CCGGACAGGC GCCGAGCGGA CCGAACAGGG TCGTGCCCGC CGCCCGGACC
GCGCCGGAGG CACGGGCCGC GGAGCCCGGG GTCCCGGGGG CGGGGGAGGG CGGCGCGCGG
TCGCGGGCTC CGGCCCCGGA GGAGGGGGCT TCCGGGGCGA ACGTGTCGGG GGCACCGGAC
GCGTCGGGGA CTTCGGGTGC GGCCGGGGAG GCCGGGGCGG AGGTTCCGGA GGCCCGGGGG
TCCGAGCAGC GGGTTTCTGA GGGGCGCGTC CCGGAGGCGC AGGCGCCGCA GGCAGGGGTT
CCCGACACGC GGGGGCCGGA GCCCCAGACC CCGGAGGCGC AGGCGGGGGA CGGTACGGAC
CTCGCGCCCG GGATCATGGC GGGCGACCCC GGTCCGGCCG ACCAGCGCTG A
 
Protein sequence
MSDRRRYCYA SGLGLGVPAL EELCARGFPP GLVVSHPAEL AHCSGYHDYG ALADRLGLPH 
LRAALDSGEV REALTFHGID LMVVAGWSGT VPEEVLSSLA LGGVGLHPAP LPVGRGRAPI
PWTILRDMRS SAVTLFHIEG EEHSGDIVDQ AWFDVAPDAT AAGLYERVGL LQAELLVRHM
EGLLEGTAPR RPQSGHASVW PRRRPSDGHL DLTASGSDVD RMVRALAEPY PGAFAMFGSA
RITLCSGRLV GGVAGGAPGQ VVATGRGREW GITCGDGAVF VPEVLRVDEG VRARPTSLAM
FRPGTFFEAP SQHMLEGTRR APLPGQAPSG PNRVVPAART APEARAAEPG VPGAGEGGAR
SRAPAPEEGA SGANVSGAPD ASGTSGAAGE AGAEVPEARG SEQRVSEGRV PEAQAPQAGV
PDTRGPEPQT PEAQAGDGTD LAPGIMAGDP GPADQR