Gene Ndas_4611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4611 
Symbol 
ID9248492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5474405 
End bp5476129 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF88 
Protein accessionYP_003682503 
Protein GI297563529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0253473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.838732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCGCT GCGCGTTGTT CGTTGACGCG GGGTACCTTC TCGCGGACGG CGCGATGGCA 
GTGCACGGAA CCCGCAACCG GGACTCCGTC TCCTGGGACT ACACCGGTCT CGTCCAGTTC
CTCAACGAGG TCGCGCGGGA CCGCACCGGC CTGCCGCTCC TGCGCTGCTA CTGGTACGAG
GCGGTCGCCG ACGACCGGCG GACCCAGGAG CAGGACGGCA TCGCCGACAT CCCCGGTATC
AAGTTCCGCG GCGCGCGGAT AAGGCCCGGC CGCCGCGAGG GCGTCGAGAG CTACGTCCAG
CGCGACCTCA CCACGCTGGC CCGTACCGGC GTCCTGTGCG ACGCCGTCCT GGTCAGCGGG
GACGAGGACA TGGCGCCGGT CGTGGCCGAC GTACAGGACA TGGGCGTGCG CGTCACGGTC
GTGCACGTCT CCGTCGAGGG CAACTGGACC ATCTCCCGGG CCCTGCGCCG CGAGTGCGAC
GACCTCATCG AGATCGGCGC GGGCCACCTG CGCCCGCACG TCAACCTGCT CTCCAGCGGC
GGCGCCGCCC AGGAGACCGC TGCCAAGACC ACCACGCCGT TCTCCAACGG CCGCGCCCGC
TCGGCGCCCG AGACCCCCAG GCAGCCCGTC GCGACGGCGC AGCCCGGCGG CAGCCGGGTG
GAGGCGGCCT CCTCGTCCGC CGTGGGCCTG GAGGCGATGT TCGCCGGAAC CGGCGGACAG
CCCGCCCCCC AGGGCAGCGC CATGGACCAG CTGCGCGCCA TGCGCAGGTC CCTCGCCCAG
CAGCGCGGCG GCGGGGACCA CCTGGCCGGG CCGAACACGG ACAACCAGCA GTCCGGCCCG
ATCGACGCCA ACGGGTTCCC CTCCGGCGGC CAGGTCCCGC CGAACGTGGG CGGCAACAGC
GCCTATCCGG GCGCCTCGAT GACCGGGGGG CACCCGGCGC CGGGCGGCCA GGGGGGCTAC
CCCGTCACCG GGGGGCACCA GTCGCTGCGC GGGGCCGACC AGACGGGCTA CGGGCTTCCG
GCGAGACAGG CACCGCCCCC GCAGCAGCAC ATGACGGGGC CGCAGCAGTC CTTCGCCGGC
GGGACCGGGC CACAGCAGTC CTTCGCCGGC GGGACCGGGC CGCAGCGCTC CTTCTCCCAG
GAGGGGCAGT CCCTGCAGCC GGGCCAGCAC GGCCAGCCGG GCCAGCACGG CGGGCAGCCG
GGGCACGGGC AACCGGGGCA CGCGCCACCG CAGAGGAGCG CCCCGCCCCA GGACCCCAGG
TTCGGCCCGG GGGACAACCC CGGCTTCGGG GGTGCGAACG ACGCAAACCC TACCTATGGG
ACCAGTACGG AAACTGACAC CGCCCAACGC GGTGGGGCGG CCCCCTCCTA CGGGGGACCG
CAGCCTACTG GTCCCGAACA AGACAGCCGG TTCGCCGCCG GCGACTACCG GGAACCGCGT
CCCAGTCCTG GATATCCCCA GCAGACGCGT CCCGCCACGC ATACCGTTGA CGAAGCGGTG
CATGTTGCGC GCAAAGAAGG GAACGACTTC GCGGAGTCGA TCGCCCGCGA GGCGCCCGCC
CTGTGGGTCG AGGCCGTCCT CGCCCGCAGG CCCCGGATGC CCTCCGACCT GGAGGCGCGT
CTGCTCCAGG GTTCCTCGCT GCCGATCGAC CACCTGTTGC GTGACGAGGT CCGGGACGGA
CTGCGCCAGG GGTTCTGGCA GGCACTGGAG CGTGCCAGGC CCTGA
 
Protein sequence
MDRCALFVDA GYLLADGAMA VHGTRNRDSV SWDYTGLVQF LNEVARDRTG LPLLRCYWYE 
AVADDRRTQE QDGIADIPGI KFRGARIRPG RREGVESYVQ RDLTTLARTG VLCDAVLVSG
DEDMAPVVAD VQDMGVRVTV VHVSVEGNWT ISRALRRECD DLIEIGAGHL RPHVNLLSSG
GAAQETAAKT TTPFSNGRAR SAPETPRQPV ATAQPGGSRV EAASSSAVGL EAMFAGTGGQ
PAPQGSAMDQ LRAMRRSLAQ QRGGGDHLAG PNTDNQQSGP IDANGFPSGG QVPPNVGGNS
AYPGASMTGG HPAPGGQGGY PVTGGHQSLR GADQTGYGLP ARQAPPPQQH MTGPQQSFAG
GTGPQQSFAG GTGPQRSFSQ EGQSLQPGQH GQPGQHGGQP GHGQPGHAPP QRSAPPQDPR
FGPGDNPGFG GANDANPTYG TSTETDTAQR GGAAPSYGGP QPTGPEQDSR FAAGDYREPR
PSPGYPQQTR PATHTVDEAV HVARKEGNDF AESIAREAPA LWVEAVLARR PRMPSDLEAR
LLQGSSLPID HLLRDEVRDG LRQGFWQALE RARP