Gene Ndas_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1941 
Symbol 
ID9245791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2364733 
End bp2366115 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content70% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionYP_003679874 
Protein GI297560900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.164047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0989418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCA CCATCCTCAG TATCGCCTTC GGGATCGTGG TGGTCTTCCT GATCACCGTG 
GCGACGGGTT ACTTCGTCGC CCAGGAGTTC GGTTACATGG CCGTGGACCG CTCGCGACTG
CGGGCGAAGG CGGCGGCCGG GGACGCCGGG GCCCAGAGGG CCCTGGGCAT CACCAACCGG
ACGTCCTTCA TGCTCTCCGG TGCCCAGCTG GGCATCACCG TGACCGCCCT GCTGGTGGGT
TACGTGGCCG AGCCCATGAT CGGCGAGGGC GTCGGTGAGC TGCTCGGCCT CGCCGGAATC
CCCACGGGGA CGGGCGTGGC CATCGGCACC GTCCTGGCCC TGCTCTTCTC CACCGTCGTG
CAGATGGTCT TCGGAGAGCT CTTCCCCAAG AACCTGGCCA TCGCCCGGCC CGAGCCGGTC
GCGCGCTGGC TGGCGCTGTC CACGGGGATC TACCTGAAGA TCTTCGGCCC GGTCATCTGG
CTGTTCGACC AGGCGGCGAT CCTGCTGCTC AAGGCGGTGC GGATCGAACC GGTCGAGGAC
GTCCAGCACG CGGCGACCGC GCGCGACCTG GAGAGCATCA TCGCCGAGTC CAAGGCCAGC
GGCGACCTGC CGCCGGAGCT GTCCACCCTG CTGGACCGCA CCCTGGACTT CCACGAGCGC
ACCGCCGGAC ACGCGATGAT CCCGCGTCCG GAGGTGGCCA CGGTGGAGGA GGGCGACCCG
GTCAGCCGGG TCGTGGAGCT GATGGCCTCC GACCACTCGC GCTTCCCGGT GCTGGGTGAC
GGCGTGGACG ACATCGTCGG CGTCATCTGC CTGCGCGACG TGCTCGCGCT GGGCGACCGG
GACCTGGCCA ACACCAAGGT CAGCGAGGTG GCGCGGCCCA CCGTGATGCT TCCCGCGTCG
CTGCCGCTGC CCTCCGCGCT GAGCCAGCTG CGCGAGGCGG GCGAGGAGTT CGCCTGCGTG
GTGGACGAGT ACGGCGGCCT GGCCGGGGTC ATCACGACCG AGGACCTGGC CGAGGAGCTG
GTCGGCGAGA TCGCCGACGA GCACAGCCCC GCGGAGGAGT CCCCCTCCTA CCTGGAGGGC
GAGGGCTCCT ACCTGGTCCC GGGCGCTCTG CACATCGACG AGGTCGAGCG GCTGCTCGGC
CACGACCTGC CCGAGGGGGA CTACGAGACC CTGGGCGGTC TGGTGGTGCA CGAGCTGCAC
CGGCTCCCCG AGGCCGGTGA CAGGGTCTCC ATCGCCCTGC CCCGACCGCC CAGCGCGCAC
GACGAGGACC CCGACATGGG GCTGACCATG GTCGTCAGCG CGGTGCAGAG GCACGTTCCG
CACACCGTGG AGCTGCGTCT GCACGAGGTG CAGGGCAACG AGTCGCAGGA GGCGTCGGCA
TGA
 
Protein sequence
MMSTILSIAF GIVVVFLITV ATGYFVAQEF GYMAVDRSRL RAKAAAGDAG AQRALGITNR 
TSFMLSGAQL GITVTALLVG YVAEPMIGEG VGELLGLAGI PTGTGVAIGT VLALLFSTVV
QMVFGELFPK NLAIARPEPV ARWLALSTGI YLKIFGPVIW LFDQAAILLL KAVRIEPVED
VQHAATARDL ESIIAESKAS GDLPPELSTL LDRTLDFHER TAGHAMIPRP EVATVEEGDP
VSRVVELMAS DHSRFPVLGD GVDDIVGVIC LRDVLALGDR DLANTKVSEV ARPTVMLPAS
LPLPSALSQL REAGEEFACV VDEYGGLAGV ITTEDLAEEL VGEIADEHSP AEESPSYLEG
EGSYLVPGAL HIDEVERLLG HDLPEGDYET LGGLVVHELH RLPEAGDRVS IALPRPPSAH
DEDPDMGLTM VVSAVQRHVP HTVELRLHEV QGNESQEASA