Gene Ndas_2503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2503 
Symbol 
ID9246353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2966655 
End bp2968265 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content69% 
IMG OID 
Productprotein of unknown function DUF112 transmembrane 
Protein accessionYP_003680428 
Protein GI297561454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.835777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCC TGACACCCAT GATCAACGGG TTCGGTGTCG TCCTCGAACC GGTCAACCTG 
CTCTACTGCC TGATCGGCGT CGTGGTCGGC ATGCTGGTCG GGGTCCTGCC CGGGCTCGGG
CCCGCGGCCA CGATCGCGAT CCTGCTCCCC CTGACCTTCG GCCTCGAACC GGTGACCGCG
ATCATCATGC TCGCGGGCAT CTTCTACGGC ACCCAGTACG GGGGGACGAT CACCTCGGTC
CTGCTGCGCC TGCCCGGCGA GGCGTCCTCG GTGGTGACGG TCTTCGACGG CCACATGCTG
GCCCGCCAGG GCCGGGCCGG GACGGCGCTG GGCATCGCGG CCGTGGGCTC GTTCGTGGGC
GGGACCGTGT CGATCGTGGC CCTGTCCCTG GTCGCGCCCC TGGTGGCGAG CTTCGCCCTG
GACTTCGGCC CGCCCGAGTA CACCGCGCTG GCGCTGCTGG GCATCCTGCT GGTGTCCACC
GTCGGCAACG GCAGCCGGAT CAAGGCGGTC ATCGCCGCCG GCGTGGGCCT GCTGCTGGCC
ACGGTCGGGC TGGACACCTT CACCGGCGCC GAACGCTTCA CCTTCGACTC CATGGCGCTG
TCCGACGGGA TCGACTTCGT GCCGATCGCG ATGGGCCTGT TCGGCATCGG GGAGATCCTG
CACAGCCTGG AGGAACGCCA CCGGGCGCCG AAGAAGCCCC TCAAGGTCAC CAACACCTGG
CCCTCGCGCA AGGACCTGCG CCAGTCGTCG GGCGCGATCG GGCGGGGTTC GCTCATCGGC
TTCGCGCTGG GCATCCTGCC CGGCGGAGGC GCCACCCTGT CCTCCCTGGC GGCCTACGCG
ATGGAGAAGC GGCGCTCACG CGACCCCGAG CGCTTCGGCA GGGGCGCGGT GGAGGGCGTC
GCCGCTCCCG AGACGGCCAA CAACGCCGCC GCCACCTCCT CGTTCATCCC GCTGCTGACC
CTGGGCATCC CGGCGAACGC GACGATGGCG ATCATCTTCG GCGCGCTGCT CATCCAGGGT
GTGCCGCCGG GACCGGAGCT GGTGACCCAG GAGCCGGAGC TGTTCTGGGG CGTCATCAAC
TCGATGTACA TCGGCAACAT CCTGCTGCTG ATCATGAGCA TTCCGCTGGT GGGGCTGTTC
GTGCGGATCC TGCGGGTGCG CCCGACGATC CTGGCGCCCA TCACGGTGCT GATCACGCTG
GTGGGCGTGT ACACGGTGCG CAACAACGTG TTCGACATCG TGCTGGTGGT GGTCTTCGGA
CTGTTGGGCT ACCTGATGAA GAAGTTCGGC TTCGACCCGG GGCCGCTGGT GCTGGCCTTC
GTCCTGGGTT CGCTGCTGGA GAGCTCGCTG CGCCGGTCAC TGCTGCTCTT CGACGGGGAC
CCCACGGGGT TCCTGACCCG GCCGATCTCG GGAACGCTGT TGCTGCTGTT GGCGGTGGTG
ATCGTGCTGC CGCTGACGCG CGCTCTGTGG CGGTGGTACC GGGGCCGGGT CGATGGCAGT
AGCAGTGGCA GTGGCAGTGC CAGTGGTGGT GCCAGTGGCA GTGGTGACGG CGGCGGTAGT
GGCGCTGGTG AGGGTAGGAG CGAGGAACCC GCAGGGAGGA CGGACGCCTG A
 
Protein sequence
MDSLTPMING FGVVLEPVNL LYCLIGVVVG MLVGVLPGLG PAATIAILLP LTFGLEPVTA 
IIMLAGIFYG TQYGGTITSV LLRLPGEASS VVTVFDGHML ARQGRAGTAL GIAAVGSFVG
GTVSIVALSL VAPLVASFAL DFGPPEYTAL ALLGILLVST VGNGSRIKAV IAAGVGLLLA
TVGLDTFTGA ERFTFDSMAL SDGIDFVPIA MGLFGIGEIL HSLEERHRAP KKPLKVTNTW
PSRKDLRQSS GAIGRGSLIG FALGILPGGG ATLSSLAAYA MEKRRSRDPE RFGRGAVEGV
AAPETANNAA ATSSFIPLLT LGIPANATMA IIFGALLIQG VPPGPELVTQ EPELFWGVIN
SMYIGNILLL IMSIPLVGLF VRILRVRPTI LAPITVLITL VGVYTVRNNV FDIVLVVVFG
LLGYLMKKFG FDPGPLVLAF VLGSLLESSL RRSLLLFDGD PTGFLTRPIS GTLLLLLAVV
IVLPLTRALW RWYRGRVDGS SSGSGSASGG ASGSGDGGGS GAGEGRSEEP AGRTDA