Gene Ndas_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2548 
Symbol 
ID9246399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3037502 
End bp3039577 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF893 YccS/YhfK 
Protein accessionYP_003680473 
Protein GI297561499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.678112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTGG GTGGTTCCCC CGGCGAAGGA CACGGAGCCG AACGCGTCGA GGTGGCCGAA 
CGACCCCACC CCAGGGTCGA CCTGCGGGCG CTGTTCGGAC TCCAGCCCGG CGCCTGGGCG
TGGACCACCG CGGTCAAGGC CGCCGTGTCC ATGTCGCTGT CCTTCGCCCT GGCCACGTGG
CTGTTCGGCT CGGAGGTCGG CACCCTGGCC GCCCTGGGCT CCATGACCGT CCTGTACGAG
AAGAAGACGC CCTACGCCTA CCGTTCGGCG GCCCTGGCCC TGGTCGGCCT GGGCTTCGTC
GCCAGCGTCA CCCTCGGCTC GCTGGCCTCG GCCCTGGCCT CCTGGGCGCC CGTGTTCTCC
ATCGGCCTGA CCGCCGGGAT CGCCACCTGG CTGTGCGCCG CCTGGCGGGT GGACAGACCG
GGCCCCCTGT TCTTCGTCCT GGTCGGCGCG ATCTCCACCA TCGCCCCCGG CGGTCTGGCC
GACGTCCCCC TGCACGCCCT GGTGGCCGCC CTCGGCGCGG CCATCGGCTG GTCGGTGTCA
ATGTCGGGCG CGCCCGTGCG CGCCCGCCAC CCCGAGTACC GCGCCGTGGC CGGGGCCTAC
CGCCAGCTCG CCTCCCTGCT GCGCGCCGTG GGCACGCCCG ACCTGGACCA CGCCCAGCAC
GAGGCCTCGG TGGCGGTGGC CGAGGCCTGG CGGATCGTCC TGCTGGCCCA GACCCGCGGC
TACCGCACCA GCCCCGAGGC CGCCCGGCTG CGCTCCCTGC TGCGGTGGGT CTCCGACGTG
CACCTGGCCA CCACCCAGGT GTGCATGGCG CGCCCCACGA CCCTGCCCGA GAAGGCCGCC
GACTTCGCCG AGCGCATGGC CCCGGCCGTG GCCGACCCCT CCCTGGGCCC CGACCCCGAC
GAGCTCGACG AACTGCGCCG GGGCCTGCGC CCCCGCTCCC TGGAGGCCCG GCTCTACGGC
CAGCTGGCCC GGGCCGCGCA GGCCGCGCGC CGCTTCTCCC ACGATGACGA CGAGCACGCC
GCGACCCTGC ACGACGAGCG CTATCCGGCG CTGTGGGACG CGCTGCGCTC CAGCCTGGGC
TCGGACTCGC TGGTGCGCCC CACCGCGCTG CGCATGTGGA TCACGGTGAC CGCGGCGGGC
GCGTTCAGCC TCCTGCTGGG CCTGGACCAC TACTACTGGG TGGGCCTGAC CGCGACCGCC
GTCCTCCAGG CGGGCAGCGT GGTCCTGACC ATGAACCGGG CCCTCCAGCG GTCCCTGGGC
ACCCTCGTGG GCGTGTTCAT CGGCGCCGCG CTGATCGCGA CGCACCCGCC GCTGGCCGTG
GTCATCGTGC TCGCCGGACT GTTCCAGGGG CTCACCCAGC TGGTGGTGGG CCGCAACTTC
TTCTACGCCT CGGTCCTGGT GACGCCGATG GCGCTGCTGC TGGCCGGAAC CGCCGCGCCC
GACCCCATCA CCGACCTGGC CGGGTCGCGC ATCATCGACA CCGTCGTGGG CTCGGTGTTC
GGCCTGCTGG GCTCGCTGCT GCTGTGGCGC AGGGCGTCGG CGACCCGCCT GCCCCAGGCC
ATCACGAGCG TGCTGGAGAC CTCGCGCGAG TGCATCATGG CGGTCCTGGA CCAGGACGTG
GAGATCGGCC CCGAACGGCG CTACCGGCTG CGCAGGGACA TGCGCGCGGC CCTGGTCAAC
CTGCGCGGGG TCTACGACAG CGCGATCGGG GACGCGCCGC GCGCGGCGTC CACGCTGCCG
CTGTGGCCGG TGGTGGTGGC CACCCAGCGC ACCGGCTACC TGGCCCTGTC CGCGCTGGCC
CGGGACCGGC CCGAGTCGGC GGGCGTCATC ACCCTGCAGC GGGTGGACCT GGCCTTCGGC
GAACTCATCT CCTCGATGAG GGAGCGCCGT ACGCCCCGCA TGGGCGCGAT CCCCCGGCTG
CCCGCCTACC CCAGGATCAA CATGGAGCTG CGGGCGCTGT CCAACGCGAT GACCAGCGCG
GTCGCCCAGG ACGAGCGCGC GGCGCGCCAG GAGGCCGAGC GGCGGGCCCA GCGCGAGCAC
CGCCGGGCCC AGAAGGATCT GGACGCCGAC CTGTGA
 
Protein sequence
MNVGGSPGEG HGAERVEVAE RPHPRVDLRA LFGLQPGAWA WTTAVKAAVS MSLSFALATW 
LFGSEVGTLA ALGSMTVLYE KKTPYAYRSA ALALVGLGFV ASVTLGSLAS ALASWAPVFS
IGLTAGIATW LCAAWRVDRP GPLFFVLVGA ISTIAPGGLA DVPLHALVAA LGAAIGWSVS
MSGAPVRARH PEYRAVAGAY RQLASLLRAV GTPDLDHAQH EASVAVAEAW RIVLLAQTRG
YRTSPEAARL RSLLRWVSDV HLATTQVCMA RPTTLPEKAA DFAERMAPAV ADPSLGPDPD
ELDELRRGLR PRSLEARLYG QLARAAQAAR RFSHDDDEHA ATLHDERYPA LWDALRSSLG
SDSLVRPTAL RMWITVTAAG AFSLLLGLDH YYWVGLTATA VLQAGSVVLT MNRALQRSLG
TLVGVFIGAA LIATHPPLAV VIVLAGLFQG LTQLVVGRNF FYASVLVTPM ALLLAGTAAP
DPITDLAGSR IIDTVVGSVF GLLGSLLLWR RASATRLPQA ITSVLETSRE CIMAVLDQDV
EIGPERRYRL RRDMRAALVN LRGVYDSAIG DAPRAASTLP LWPVVVATQR TGYLALSALA
RDRPESAGVI TLQRVDLAFG ELISSMRERR TPRMGAIPRL PAYPRINMEL RALSNAMTSA
VAQDERAARQ EAERRAQREH RRAQKDLDAD L