Gene Ndas_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2018 
Symbol 
ID9245868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2438549 
End bp2439709 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF993 
Protein accessionYP_003679950 
Protein GI297560976 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.267016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAC TCCTGCTACC CGCCGCCGAC GGCACCGCCG CCCCCTACAC CCTGAGCGGC 
ACCCCCGTCG ACGCCTCCCC GCTTCCCCCC GCCCGCTCGC GTACGGCCTA CGCGGCGGCC
CACGTGGTCG CCGACCCCCT GGCGGCCAAC GCCCCCGGCG CCCCGGCCCG CCTGGACTGG
GAGGCCACCC TGGCCTTCCG CCACCACCTG TGGGACCAGG GCCTGGGCGT GGCCGACGCC
ATGGACACCG CCCAGCGCGG CATGGGCCTG GACTGGGCGG CCACCGCCGA GCTGATCCGC
AGGTCGGGTG CGGAGGCCGC CTCGCGCGGC GCCGCCCTGG CCTGCGGGGT GGGCACCGAC
CAGCTCGACC TTTCGCAGGC CGATCTGGAA AGCGTTATCA CCGCGTACGG TGAGCAGCTC
GACGTCGTCC AGGGCGCCGG AGCCACCCCC ATCCTCATGG CCTCGCGCGC CCTGGCCGCG
GTCGCCTCGG GCCCCGACGA CTACGCCAAG GTCTACGCCG ACCTGCTGGG CCGGGCCGAC
CAGCCCGTCA TCCTGCACTG GCTGGGCACC GCCTTCGACC CGGCCCTGGC CGGGTACTGG
GGCTTCGCCG ACCCGGCCGA GGCGATCGAG CCGGTGGCGG CCCTGATCGC CGAGCACGCG
GCGAAGGTGG ACGGCATCAA GGTCTCCCTG CTCGACGCCT CGCTGGAGGT GCGGCTGCGG
CGGCTGCTGC CCGAGGGCGT GCGCCTGTAC ACCGGCGACG ACTTCAACTA CCCCGACCTC
GTCCTGGGCG ACGAGCAGGG CCACTCCGAC GCCCTGCTGG GCGTCTTCGC CGCCATCGCC
CCGGCCGCCG CCCGCGCGCT GGCCGCCCTG GACGAGGGCG ACACCGCCCG GTACCGGGCC
CTGATGGACC CCACGGTCCC GCTGGCCCGG CACCTGTTCA CCGAGCCCAC CTTCTACTAC
AAGACCGGCG TGGCCTTCCT GGCCTGGCTC AACGGCCACC AGAAGGGCTT CCACATGGTG
GGCGGGCTGC ACAGCGCCCG CGACCTGCCC CACCTGGCCC AGGCGGTGCG CCTGGCCGAC
GCCGCCGGGG CCCTGACCGA CCCCGACCTG GCCGCGGCAC GCATGCGGGC GCTCCTCCAG
GTGTCAGGAG TGGACCAGTG A
 
Protein sequence
MSTLLLPAAD GTAAPYTLSG TPVDASPLPP ARSRTAYAAA HVVADPLAAN APGAPARLDW 
EATLAFRHHL WDQGLGVADA MDTAQRGMGL DWAATAELIR RSGAEAASRG AALACGVGTD
QLDLSQADLE SVITAYGEQL DVVQGAGATP ILMASRALAA VASGPDDYAK VYADLLGRAD
QPVILHWLGT AFDPALAGYW GFADPAEAIE PVAALIAEHA AKVDGIKVSL LDASLEVRLR
RLLPEGVRLY TGDDFNYPDL VLGDEQGHSD ALLGVFAAIA PAAARALAAL DEGDTARYRA
LMDPTVPLAR HLFTEPTFYY KTGVAFLAWL NGHQKGFHMV GGLHSARDLP HLAQAVRLAD
AAGALTDPDL AAARMRALLQ VSGVDQ