Gene Ndas_4726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4726 
Symbol 
ID9248608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5609595 
End bp5610851 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content73% 
IMG OID 
Productprotein of unknown function DUF1205 
Protein accessionYP_003682618 
Protein GI297563644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGAATCG TCGTCGCGGC TTACGCGGAC AAAGCTTATT TCTTTAGCAT GGTGCCGACG 
GCGTGGGCAC TGCGCGCCGC CGGGCACGAG GTCCGGATCG TCACCCAGCC CTCCATGACC
GAGGCCGCGG CGGAGACGGG CCTGACCGTG GTCCCGGTCG GCGGCGACCA CACCCTGGCC
GAGGTGCTGG CCCACGCCCG CGATCAGCAG GGCGAGTCGA TCTTCGACCT GGCCGAGGAG
CGGCCGGAGA TGCTGGTGCC GGAGAAACTG CACCACGCCT ACGAGGAGTA CGTCACCTGG
TGGTGGAAGC TCGTCAACGA GCCGATGGAG CGGGACCTGG TCGCCTTCTG CCGCGAGTGG
CGCCCCGACC TCGTGCTGTG GGAGCCCAAC ACCTACTCCG CGGCGATCGC CGCGGAGGCG
TGCGGCGCCG CGCACGGGCG GTTCCTGTGG AGCGTGGACC TCTTCTCGCG CATGCGGCGC
CTGTACCTGG GGGCCGCCGA CGCCACCCCC GGACCCGACC CGCTCAGGAG CTGGCTGGAG
GAGAGCGCGG ACCGGCACGG GGTGGCGTTC TCCGAGGACC TCGTACTCGG CCAGTTCTCC
GTCCACCAGA TCCCGGAGGC GCTGCGGCCG CGCGAACTGG AGAAGACGGG CACCCACCTG
AGCGTGCGCC CGGTTCCCTA CGCCGGAAGC GCCGTTCTGC CCTCCTGGGC ACGGGCCGGG
TCCGAGCGGC GGCGCGTCCT GGTGGACTGG GGGTCCTGGA GCAGGACGGC CGAGGGCGCC
GCCGCCCTGG TGGACGTCAT CGACGCCTGC GCCGAGATCG GCGCCGAGGC CGTCGTCCTC
TCCCCCGCCT CCCGCAGGGA CTCCCTGCCC GCCCTGCCCG AGGACGTGGT GGTGACCGAC
TCGGGGGCGG CCCACATGCT CATGGGCTCC GGCTCGCTGA TCGTCCACGG CGGCGGCTTC
GACGTGTGCT GCAACGCCGT GGTCGAGGGA CTGCCGCAGC TGGTCGTGCT CAACACCGAG
CAGTTCGACG CCGCTCCGCT CTCGCGGGCG CTCCGGGAGC GCGGGGCCGC GCGCGTACTG
GCGGTGGAGG AGGTCCTCAC CCGGGGCGTG GACGACCTCC TCACGGAGCT CCTGGACAGC
GGGGAGGTAC GCGCGGCGGC CGGGCTGGTG CGCGACGAGG CCCTGGCCGT GCCCGCCCCG
GACCAGGTGG TGCCGGAGCT GGAGCGCATC GCGGCGGCCC GTGGCGGCGG CGTCTGA
 
Protein sequence
MRIVVAAYAD KAYFFSMVPT AWALRAAGHE VRIVTQPSMT EAAAETGLTV VPVGGDHTLA 
EVLAHARDQQ GESIFDLAEE RPEMLVPEKL HHAYEEYVTW WWKLVNEPME RDLVAFCREW
RPDLVLWEPN TYSAAIAAEA CGAAHGRFLW SVDLFSRMRR LYLGAADATP GPDPLRSWLE
ESADRHGVAF SEDLVLGQFS VHQIPEALRP RELEKTGTHL SVRPVPYAGS AVLPSWARAG
SERRRVLVDW GSWSRTAEGA AALVDVIDAC AEIGAEAVVL SPASRRDSLP ALPEDVVVTD
SGAAHMLMGS GSLIVHGGGF DVCCNAVVEG LPQLVVLNTE QFDAAPLSRA LRERGAARVL
AVEEVLTRGV DDLLTELLDS GEVRAAAGLV RDEALAVPAP DQVVPELERI AAARGGGV