Gene Ndas_4849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4849 
Symbol 
ID9248735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5746110 
End bp5747609 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content72% 
IMG OID 
Productpolynucleotide adenylyltransferase/metal dependent phosphohydrolase 
Protein accessionYP_003682738 
Protein GI297563764 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAACA CAGAGTTTTC GCGGGAGAAC GACGCCGCCC AGCAGCCGGA CGCCGGGGAC 
GCGACCGGTG CGGGCGCGCT GACCGGTGCG CAGCGCGCCG CGGTCACGGC GTTGGCGGAC
TCCTTCCCGA CCGTCTCGGA GGAACTCGGC GAGCGCTTCG CCGCGCACGG CCAGCAGCTC
GCCCTCGTCG GCGGACCGGT GCGCGACGCC CTCCTGGGCA GGCCGGCCAA CGACGTGGAC
CTGACCACGG ACGCGGTGCC CCAGCGCATC CTGGAGCTCG TGGACGGCTG GGCCGACTCC
GTGTGGACCG TCGGCATCGA CTTCGGCACC GTGGGCCTGC GCAAGAAGGG CCTCCAGCTG
GAGGTCACCA CCTACCGCAG CGAGTCCTAC TCGCCCAAGT CCCGCAAGCC CGAGGTGGCC
TACGGCACCG ACATCCACGA CGACCTGCTG CGCCGCGACT TCACCGTCAA CGCCATGGCG
GTGCGCCTGC CCGGCCTGGA GTTCGTCGAC CCGTTCGGCG GCCTGGCCGA CCTGCGGGCC
AAGGTGCTGC GCACCCCCGG CAGGCCCGAG GACTCCTTCA GCGACGACCC GCTGCGCATC
ATGCGCGCGG TCCGCTTCGC CGCCCAGCTC GGCTTCACCC TGGCTCCCGA GGTGGCCGAG
GCCGCCCGGG ACATGGCCGA CCGGCTCTCC ATCGTCTCCG CCGAGCGCGT CCGCGACGAG
CTGACCAAGC TCATGCTCAG CCCCGACCCG CACCGCGGCA TCGAGCTCAT GGTGGACCTG
GGCATCGCCC GGTACGTGCT CCCGGAGATC CCCAAGCTGC GCCTGGAGAT CGACGAGCAC
CACCGGCACA AGGACGTCTA CGAGCACTCG CTCACCGTCC TGGACCAGGC GATCGAGCTG
GAGGAGAAGC GGGGCCACGA GCCCGACCTG GTGCTGCGCC TGGCCGCGCT GCTGCACGAC
GTCGGCAAGC CCAAGACGCG CGCCTTCGAG TCCGGGGGGC GGGTGACCTT CCACCACCAC
GAGGTGGTGG GCGCCTCGAT GAGCCGCAAG AGGCTCACCG CGCTGCGCTT CCCCAAGGAC
GTGGTGTCCG ACGTCAGCAC CCTGGTCGAA CTGCACCTGA GGTTCCACGG CTACGGCAGG
GGCGAGTGGA CCGACTCCGC GGTCCGCCGG TACGCCCGCG ACGCCGGTTT GCAGCTGGAG
CGGCTGCACA TCCTCACCCG TGCCGACTGC ACCACCCGCA ACCGCCGCAA GGCGGCGGCG
CTGGCGCGCT CCTACGACGA CATCGAGCGG CGCATCGAGC TGCTCGCCGA GCAGGAGGAG
CTGGACCGCA TCCGCCCGGA CCTGGACGGC AACGAGATCC AGGAGATCCT GGGGGTCAAG
CCGGGTCCCG TGGTCGGCCG CGCGTACCGC TTCCTGCTGG AGCTGCGCCT GGAGAACGGC
CCGATGGGCA GGGAGGCCGC GACCGAGGAG CTGCGCGCCT GGGCGGCCGA GCACCTGTGA
 
Protein sequence
MPNTEFSREN DAAQQPDAGD ATGAGALTGA QRAAVTALAD SFPTVSEELG ERFAAHGQQL 
ALVGGPVRDA LLGRPANDVD LTTDAVPQRI LELVDGWADS VWTVGIDFGT VGLRKKGLQL
EVTTYRSESY SPKSRKPEVA YGTDIHDDLL RRDFTVNAMA VRLPGLEFVD PFGGLADLRA
KVLRTPGRPE DSFSDDPLRI MRAVRFAAQL GFTLAPEVAE AARDMADRLS IVSAERVRDE
LTKLMLSPDP HRGIELMVDL GIARYVLPEI PKLRLEIDEH HRHKDVYEHS LTVLDQAIEL
EEKRGHEPDL VLRLAALLHD VGKPKTRAFE SGGRVTFHHH EVVGASMSRK RLTALRFPKD
VVSDVSTLVE LHLRFHGYGR GEWTDSAVRR YARDAGLQLE RLHILTRADC TTRNRRKAAA
LARSYDDIER RIELLAEQEE LDRIRPDLDG NEIQEILGVK PGPVVGRAYR FLLELRLENG
PMGREAATEE LRAWAAEHL