Gene Ndas_2946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2946 
Symbol 
ID9246799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3519408 
End bp3521060 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680862 
Protein GI297561888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.288376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGCTT CTTTACTTCT GGGGCCCCTG CTCAGGCACC CGGGGGAGAC CACCGCCACA 
GTGTGGGTGG AGACGGACGC CCCCTGCGCG GTGCGTGTCG TGGTCGGCGG CGCCGCCGAG
GCGACCGCCC GGACCTTCAC CGTGCACGGC CACCACTACG CCGTCTGCAC GGTCGCGGGG
CTGGTCCCGG GCTCCCGCCT GCCCTACGAG GTGTTCCTGG ACGAGGACCG GGTCTGGCCG
GAGCCGGACA GCCCCTTCCC GCCCAGCACC GTGCGCACCG TGGACCCGGA GGCGCCCACG
CGGCTGCTGT ACGGGTCCTG CCACACGCCC ACCCACGACA CTCCCGAGGG CGTGGTCCGC
TACGGCCCCG ACATGCTGCG CGCCACGGCC CGGCGGCTGG CCCGCGAACC GGTCGGGGGC
AACCTGGCGC TGCTGCTCAT CGGCGACCAG GTCTACGCCG ACGAGGTGCA GGAGTCGATG
CTCGCGTTCC TGCGCGAGCG TCGCGTCCGC GGGGGCGCCC GCGACGACCC CGACGACGAG
GTCGTGTACT ACGACGAGTA CGCCGAGCTG TACCGGCAGG CCTGGAGCGA CCCGCAGGTG
CGGTGGCTGC TGTCCACCGT GCCGACCCTG ATGGTCTTCG ACGACCACGA CGTCCGCGAC
GACTGGAACA CCTCGGCCGC CTGGCGGCGC GCCATGGACC GCCAGCCCTG GTGGCGCAGG
CGCATCACCA GCGGTCTGGG CTCCTACTGG GTGTACCAGC ACCTGGGCAA CCTCTCCGAG
CAGGAGAGGG AGGGCGACCT GCTGTGGAAG CGGGTGCGCG ACGCCGACGG CGACGCCGAG
GACCTGGTCG ACGCCTTCGC CTGGCAGGCC CACAGCGAGC CGTCGAGCTA CCAGTGGGGG
CACCACCACG ACTTCGGCGG TGTCCGGCTC GTGATGGCCG ACACCCGCTG CTCCCGCGCC
CTGGGCGAGG GCGACCCCTC CGACGGGTCC AGGTCGATCC TGGGGCCCCA GGGCCACGAC
TGGCTGGACG GGCACCTGAC CGGCGGGCCG GACCACGTCG TGGTCGCCTC GACCGTCCCG
GTGCTGCTGC CCCCGGCCGT GCACCGGTTG GAGGCCTGGA ACGAGGCGGT GTGCGCGGGC
GCCTGGGGGC GGTGGCTGGC GGGGCCCGCC GAGCGGCTGC GCCAGGACAT CGACCTGGAG
CACTGGGCCG CCTTCCAGTA CTCCTTCCGC CGACTGTCCG ACACCGTGGG CGAACTCTCG
CGGGGCGAAC GGGGCCCGGC CCCGGCCACC GTGCTGTTCC TGGGCGGGGA CGTGCACTTC
TCCTACCTGG CGCGCGCCCG CCACCGCGGG GGCGGCGCGA GCCGGGTGAC CCAGCTGGTC
TCCTCCCCGC TGTGCAACCA GGCCCCGACG AGCATGCGCC GCATGGTCCG GCTGTCGGTG
AGCCGCCCGC TGCGCGCGAT CGGGTGGCTG CTGACGCGGC TGGCGGGGGT ACCCGAGCCC
GATCTGCGCT GGGACCTGGG GTCGGCCCCC TACTTCGGCA ACACCCTGGG CCAGGTGGAC
TTCGACGGGC GCGCGGCCAG GGCCTCCTGG TACCACTGCG CCCAGGGCGG GGGCGACGCG
CTGCCCGACG TGCGCATGAC CGCGGACCTC TGA
 
Protein sequence
MTASLLLGPL LRHPGETTAT VWVETDAPCA VRVVVGGAAE ATARTFTVHG HHYAVCTVAG 
LVPGSRLPYE VFLDEDRVWP EPDSPFPPST VRTVDPEAPT RLLYGSCHTP THDTPEGVVR
YGPDMLRATA RRLAREPVGG NLALLLIGDQ VYADEVQESM LAFLRERRVR GGARDDPDDE
VVYYDEYAEL YRQAWSDPQV RWLLSTVPTL MVFDDHDVRD DWNTSAAWRR AMDRQPWWRR
RITSGLGSYW VYQHLGNLSE QEREGDLLWK RVRDADGDAE DLVDAFAWQA HSEPSSYQWG
HHHDFGGVRL VMADTRCSRA LGEGDPSDGS RSILGPQGHD WLDGHLTGGP DHVVVASTVP
VLLPPAVHRL EAWNEAVCAG AWGRWLAGPA ERLRQDIDLE HWAAFQYSFR RLSDTVGELS
RGERGPAPAT VLFLGGDVHF SYLARARHRG GGASRVTQLV SSPLCNQAPT SMRRMVRLSV
SRPLRAIGWL LTRLAGVPEP DLRWDLGSAP YFGNTLGQVD FDGRAARASW YHCAQGGGDA
LPDVRMTADL