Gene Ndas_3237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3237 
Symbol 
ID9247094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3869261 
End bp3870847 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content71% 
IMG OID 
Productinner-membrane translocator 
Protein accessionYP_003681149 
Protein GI297562175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.941278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGGA TCCCCCGTCC CGCGAGGACC CTGCCCGCAC CCGCGGCCGC GGGCGTCGCG 
ATCGCCCTGG GCGTGGTGCT GACCATCGCC CTGGACACGG CGCTGCTGTG GTGGGCGGCG
CTGATCATCG GCGTCGTCGC GGCCTTCGCC TTCGTGGCCT GGTCCGGGCC CATGGTGCCG
CGCGAGCGCC GCGCGGGCGC GGACGGCGCC CGCGAGGAGG GCACGGTCCT GCACAACACG
CTGGTGGTGC CCACCGCCCT CGTGCTGTCG GTGGCCGCCG GTCTGCTGAC CGCGTGGTTC
CTGGACCTGT CCCTGATCGA GCCGGTGGCC GCCGCGCTGG GCGCGCTGGC CGGCTCCGCC
GCGGCCTTCT GGCTGTTCCA CCGGCTGTCG GCCATGCTGG CGCTGTCCTT CGCCGCGGTC
CTGGCCGCCG GGGTCATCGC GACCGCGGTC ACCGCGGCCG TGCTGTTCTT CAGCGGCATC
CCGCCGCTGG CCACGTTCGA GCGGATGCTG GAGTACGGCA CCCGGCCCGA CAGCCTGGTG
CGCATCGTCA ACGACGGCAC CACGTACTAC CTGGCCGCGG TCGCGGTGGC GATCGGCTTC
AAGATGAAGC TGTTCAACAT CGGCGTGGAC GGCCAGTACC GGCTCGCCGC GCTGCTGTCG
GCCGCCGTGG GCGGTTACAT CATGCTGCCG CCGGTACTCA GCCAGATCGT CATCGTCATC
ACGGCCGTGG CCGTGGGCGG CGTGTGGGCG GGCATCGCCG GGTACCTCAA GGTGACCCGC
GGCGTGTCCG AGGTGATCTC CACGATCATG CTCAACTCCA TCGCCACCGG TGTCACCGCC
TACCTGCTGA GCACCGACCG CCTCGCGGTG GAGATCAGCA CCAACAACAT CGGCACCCCG
CCGATGCCCG AGTCCTCCTG GGTGCCCGGC ATCCCGGCGG GCTTCCTCGG CTCGGACGAG
ACGATCTTCG GGTTCGTGCT CCTGGCCGTG GCCGTCGGCA TCGGCTACTG GGTGATGCTC
AACCGCACCC GCTTCGGCTT CGAACTGCGG GCCACCGGCC AGTCGCAGAC CGCGGCCGAG
GCCAGCGGCG TCAACGTCAA GAAGATGGTG TTCCTGTCCA TGCTCTTCTC GGGCATGGTC
GCCGGTCTGG TGGGCCTGCC CCAGTTGCTG GGCAGCTCGC ACCACTACGC CCTGGACTTC
CCGACCGGCA TCGGCTTCAT CGGCATCGCC ATCGCGCTGC TGGGTCGCAA CCACCCCGTG
GGCATCGTCT TCGCCGCCCT GTTCTGGGCC TTCCTGAACC AGGCGTCGGG CATCCTGCCC
TTCGACGGCA TCCCGCAGGA GATCGCGGTC ATCTCGCAGG CCACCATCGT CCTCACCGTG
GTCGTGGTCT ACGAGGTCGT GCACCGCTGG GGCCGCCGCT ACCAGCAGCA GCAGATCGGC
AGGCAGCTCG GCACCACCGC CGAGTCGCCC GCCCCGGAGA CGGTGAAGGT GTCCGCGGAC
GAGACGTCCC CGGCCCCCGC GGACGGTTCC ACCGACGCGG GCACCGCCCC CGAACCCGAG
GACAGGCGCG GAGGGGAGGC CAAGTGA
 
Protein sequence
MSRIPRPART LPAPAAAGVA IALGVVLTIA LDTALLWWAA LIIGVVAAFA FVAWSGPMVP 
RERRAGADGA REEGTVLHNT LVVPTALVLS VAAGLLTAWF LDLSLIEPVA AALGALAGSA
AAFWLFHRLS AMLALSFAAV LAAGVIATAV TAAVLFFSGI PPLATFERML EYGTRPDSLV
RIVNDGTTYY LAAVAVAIGF KMKLFNIGVD GQYRLAALLS AAVGGYIMLP PVLSQIVIVI
TAVAVGGVWA GIAGYLKVTR GVSEVISTIM LNSIATGVTA YLLSTDRLAV EISTNNIGTP
PMPESSWVPG IPAGFLGSDE TIFGFVLLAV AVGIGYWVML NRTRFGFELR ATGQSQTAAE
ASGVNVKKMV FLSMLFSGMV AGLVGLPQLL GSSHHYALDF PTGIGFIGIA IALLGRNHPV
GIVFAALFWA FLNQASGILP FDGIPQEIAV ISQATIVLTV VVVYEVVHRW GRRYQQQQIG
RQLGTTAESP APETVKVSAD ETSPAPADGS TDAGTAPEPE DRRGGEAK