Gene Ndas_3143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3143 
Symbol 
ID9246999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3759550 
End bp3760965 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content77% 
IMG OID 
Productprotein of unknown function DUF901 
Protein accessionYP_003681058 
Protein GI297562084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0279599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC GCCCCGGTGC GGGCGGCGAC TCCGACGGCG GCGATCCGCT CGACGGCGGC 
GTCCACCCGG CGGAGGACGG CGGCCACGGC GCCGGCGATG ACGGCGGTGA CGGCGGCGAA
CGGCTGGCCC GGCCGCTGCC CGAGCAGGTG CGCGCCCGGG TCGTCGAGTA CGGCTCGGAC
GTGCTCGGCG GCATGCGCGC GAGCGACCTG CCCCCGCTGC TGCGCAGGGT CGCCAGGTTC
GAGCCGCGCC GCAGGGCCCG GCTCGCCGGA CCGCAGATCG CCGCCCAGCT GGAGAACGAC
GAAACCTTCC GCGGCATGGT CGCGGCCCGC GTCGACCAGG TGTGGCCCGA GCTGGCCGAG
GGCCTGCGCT CGGGCGTGGT GCCCCCCGCG GCCGACCCCG TGGCCGTGGC CGCCTGCGCC
TACCTGCTGC GGCCCCCGGG GTGGCCCGGC ATCGTCGAGG ACGTCCACCG GGAGCTGGAG
CGCCAGACCA GCGTCAAGGA GGCGGACCAG GCCGCGGAGG CCCTGGACGC CGCCCGCCGC
CAACTGGACG AGACCCGGCA CGACCACCAG GAGGAGCTGG AGCGGCTGCG GTCCCAGATC
AAGGCCCAGC GCACCGAGAT CGCCGAGCTG CGCCGCAAGG TGCACACCGA GCGGCAGCGG
GCCAAGGAGG CCACCGAGCG GGCCTCGCGT GCGCTGACCG AGACGGCCGG ACGCGAGTCG
GAGTCCGCCG CGCGGGTCGG CGCGCTGGAG TCGCAGAACC GGCGGCTGAG ATCGAGGCTG
GCCACGGCCG AGGCCCAGCT GGACAACGCC CGGCGGGCGG TGCGCGCCGG ACGCAACGCC
GACGAGGCGC GGCTGCGCGT GCTGCTGGAC GTACTGGTGG AGGCCTCCCA CGGCCTGCGC
CGCGAACTGG CGCTGCCCAC CGTCCTGGAC AGCCCCGCCG ACCTGGTGGC CGAGACCGAG
CAGCAGCGGC GGGTGTCCCT GGGCGGGCTG CCCGACGACG ACCCCGGCCT TCTGGAGCAC
CTGCTCACCG CGCCCCGGGT GCACCTGCTG GTGGACGGCT ACAACGTCAC CAAGACCGGC
TACGGGACCC TCCCCCTGGC CGACCAGCGC ACCCGGCTGA TGAACTCCCT GGAGGGGCTG
GCCAGCCGGA CCAAGGCCGA GATCACGTGC GTGTTCGACG GCGCGGACGT GGACACCCCG
CCGGTGATGG CGGCGGCGCG CCGGGTGCGG CTGCTGTTCA GCGCGCCCGG GGAGACCGCG
GACGAGCTGA TCGTGCGGCT GGTGCGCGCC GAACCCCCGG GGCGACCGAT CGCGGTGGTC
ACCTCCGACC GCGAGATCGT GACGGCGGTG CGCCGCGCCG GGGCGCGCGC GGTGCCCTCG
ACGATCTTCC TGCGCCGCCT GGAGGCGCAC GGCTGA
 
Protein sequence
MSARPGAGGD SDGGDPLDGG VHPAEDGGHG AGDDGGDGGE RLARPLPEQV RARVVEYGSD 
VLGGMRASDL PPLLRRVARF EPRRRARLAG PQIAAQLEND ETFRGMVAAR VDQVWPELAE
GLRSGVVPPA ADPVAVAACA YLLRPPGWPG IVEDVHRELE RQTSVKEADQ AAEALDAARR
QLDETRHDHQ EELERLRSQI KAQRTEIAEL RRKVHTERQR AKEATERASR ALTETAGRES
ESAARVGALE SQNRRLRSRL ATAEAQLDNA RRAVRAGRNA DEARLRVLLD VLVEASHGLR
RELALPTVLD SPADLVAETE QQRRVSLGGL PDDDPGLLEH LLTAPRVHLL VDGYNVTKTG
YGTLPLADQR TRLMNSLEGL ASRTKAEITC VFDGADVDTP PVMAAARRVR LLFSAPGETA
DELIVRLVRA EPPGRPIAVV TSDREIVTAV RRAGARAVPS TIFLRRLEAH G