Gene Ndas_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1230 
Symbol 
ID9245080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1528467 
End bp1531841 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content76% 
IMG OID 
ProductOmpA/MotB domain protein 
Protein accessionYP_003679175 
Protein GI297560201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00819992 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCCAGA TCCGACGGCT GAGCGCCCTC CTGCTCGCAG CGGCGATCCT GCTCGGCATC 
CCCTACCTGC TCCTGTGGCA TCTGCCCTGG CCGGAGCTGC CGGACTCCTG GGCCGTGGCC
GCGGCCCACC TGCGCGGCTG GCGTCTGCCG CCGGGTGTGT TCACCGCCCT GCTCATCACC
GTGCTGTGGG CCCTGTGGGG TCTGTACGCG GCCGGACTGG TCGTGGAGAC CTTCGCCCGC
CTGCGCGGCA TGCCCCGCAG GCTGCGCCCC CTCGGGCCGC TCCAGGTGGT GGCGGCGGCC
GCCGTGGGAA CCGTGGCGGT CGCCCCGGCC ACCGCGTTCG CCGACACCGT CACCACGCAG
GAGGACCGCG AGGCGGACGG GCAGCAGGAC GCCGACGCGC CCCCCGCCCC CGAGGAGGCC
CCGTCCGCGC AGCCCGTCGA GCGGGTGCGC ACCGTCTCCG GTTTCGGCGT CGGCTCGGCC
GAGCTCACCG AGCAGATGCG CGACGACCTC GCACCGGTCG CGGAGATGAT CGACTCCTAC
GGCGACACCG AGACGCCCGT GCGCATCACC GGCCACACCG ACCCCAGCGG GAACGCCCAG
ACCAACCAGG AGCTGTCCGA GCGCCGGGCC CAGGCGGTCG CCGACCACCT CGACGAGGTC
CTGGGCGAGG AGGCCCCGGA GATGGAGGTG GAGGGCGTCG GCTCCGAGCA GCCCCGCGAG
GGCGGCGCCG CCGCCCAGCG CAGGGCCGAG GTCGCCTACA CCGTCGTTCC GCAGCCGACC
TCCCTCCAGT CGCGTGTGAT GGCCGCCGAG ACGCCGGACC CCGGTGCCGG TGCCGGGGAG
ACCGCTGCCC CCGTCGAGGA GGACGACCCC GTGGAGGCGG CGGCCGCTGC CGCCGCCGAG
GCCGCCGGGG ACGGCGAGAA CGTGCTGGTC GTGGAGATCC CCGACGGCGC CGTCACCGGA
GCGGTGGCGT TCGCGGGCCT GGCCGGGGGA TACCTGCTGG GCAAGCGGGG CGTCCGGATG
CCCGGCGTCA GCCTGAGCCT GCCCCGCCTG CCGAGGGCGC CCCGCCGCCT GGCCCTGACC
GCCCCGCCGC CCCGCCCCAC CCCCGGGGAC GAGATCGACG AGCGGGTGAC GGTGGAACTC
GACCACGTGC CCGGCCTCGG CATCACCGGG CCGGGTGCGG CGGGGGCCGC CCGGCGCCTC
ATCGTCAACG CGCTCGACCG CCTCAACGGC GACGCCGTGC GCGTGCTGCT CACCGAGGCC
GAGGCCGCCC GCCTGGTGGG CGACCGGGGC CGCGACCTGC TGCGCGCGCA TCCCTGCGAG
CCCGTGCGGA TGGTCGGCAC GATGGAGGAG GCCCTGACCG TCCTCCAGCG CGAGCTGCAC
CAGATCGTCG AGGAACCGGG GGAGCGGCCG CACACGCCCC TGGCCCTGGT GACCTCGCCC
ACCTCCGAGC ACGAGACCGC GCTGTCGGGT CTGCTGCTGC ACGGGCAGCA GCAGGGCATC
ACGGCCGTGG TCCTGGGCCG CTGGCCGCTC GGGGGCAGCT GCGTCATCGA GGAGGACGGT
CTCATCACCG AGACCAGCCC TCCGCTCAAC CCGGTCTTCC ACTGCTCCTG GCCCGGCGCG
ACCGCCGAGG AGGTCATGGA CGCCGTCCGC GCCTACCGCC ACTCCTCGCC CGCCCTGGAG
GAGACCGGTG ACCGGGTGCC GAGCCCGGAG GCCGCCGCGG AGACCGTCCC GGACGTCGCG
CTGGAGGAGG CCGGTTCCTT CGCGGAGCCG TTCCTCGGCG CCCTGGGGGA GCAGACCGCT
GTTGAGACGG CGGAGGCGAA CGAGCCGGTA GAGACGGCTG AGTCGACCGA GTCGACCGAG
TCGACCGAGT CAGTCGATGC GGTTGATGCC GCTGTGGGCG AGAAGGACCA GAAGAGGGCC
GAGGCGGCCG CGAAGCCCAG GAGGACCAAG GGGATCAGGA AGGCCAAGGC GGCCGACGCC
TCGGAGGAGG GCTTCTGGGA CGCCGACGTC TGGAACACCG ACTTCTGGGA GACCGACGCG
CCCGCGGCCG GGAAGGCCGA CACCGACCGG TCCGAGACCG AGTCGGCCGA GGCCGCGCCG
TCCGTGCCCG AGCGGCCAGG TGCACTGTGG TCCGAGGCCG AGCCGGACGA CACGGCCCGC
TTCGGGGCCG CGTCGGAGGT CTCCACGCGG CCGGACACCC GGTCGGACGG TTCCGTCATC
GGGCGGCCCG CCGTGGAGGA GAGCACGACG CGGCCTTCCT CGTCCGCACC CTCCGTTTCC
GAGGAGGCGG TTTCCGGATC GTCCGCGGAC GCGGAACCCG CGGGCACCGG AGCGGACCGC
TCCGCCCCCG AGCCGTCCGC CGCCGAGGAG GCCGAGCCCC GGCCGTCCGC GTTCGAACAG
CTCGCCGCCG AGGAACAGGT GTCCCGGGAG ACCGTGACGG GACCGTCCGA GTCCGCGCGG
CCCACCGCCT CCGCACCCGC TCCGGAGAAG GCCGCCGCCG AGGAGTCCGC GCCCGAGCAG
AGTGCGTCGG AGCGGTCCGC CCGGGAGGAG GGCGCCCCCG TGGCCTCCCC GTCCGAACGG
CCCTCCGCCG AACAGTACGC CGCCGAGCTG TCCGCGCCCG CGCCGTCCGC GACCCGGGAA
CAGCGCGCGC CCGAGGAGAC CGCGTCCGAG CCCGCCGAGC CCCGGGAGAA CGCGTCCGCC
CAGGTCACCG CTCCGGCACG GACGGAACAG CCGAAGGAGA ACGCACCCGC GGAGCCCCCA
GCCCAGCGCC CCACGTCCGA GGAGGCCGTC CCCGCCAGGC CCGCCGCCGA GCGGCGTGAG
CCGGAGCCCG AGCCCGCCAG GCCCGCCCCC GGGCGTCCGA GCACCGGCAC GGTTCCGTCG
GTGGTCGCCT CTCCAGCCCG CGCCCAACGC CCGTCCCAGC AGACGCGCAC GGCGCCCGCC
CAGCAGGAGA CGGCCTCCGA GGAAGAGACC GCCCCCGTCC GCAAGGCCGC CAAACGCGCC
GTGCGCTCCC GCCTCCCGAA GCCGAGCGGG ACCGCGTCCG CTCCGGCGGC CCAGGCGTCC
CCGTCCGCCC CGGCGGCGCG GTCCGAGCAG CTCCCGCTCC CGGACCAGCC CGCCCGCCCC
GAGCGGGCCC GCGCCCGCGC CCAGAACGCG GCCGCGTCCG GCGCCGAGCC CCTCACCAGC
CGCGCGGAGC GCATGGAGAG GCGCACCAGG GCCGAACGCA GGAACGCCAC CGCCCCCGCC
GGAGCCGTCG CCGAGCACCC CGTGCCGGAC ACCGAGGGCG GGGAGTCGGA CGGGTCCCAG
CCGTCCCGGC CCCTGCTCCG CAAGCCCAAG AAGGCGGGAC GCGGAAGGAG CTGGCGCCCC
AGGGACAACT CCTGA
 
Protein sequence
MTQIRRLSAL LLAAAILLGI PYLLLWHLPW PELPDSWAVA AAHLRGWRLP PGVFTALLIT 
VLWALWGLYA AGLVVETFAR LRGMPRRLRP LGPLQVVAAA AVGTVAVAPA TAFADTVTTQ
EDREADGQQD ADAPPAPEEA PSAQPVERVR TVSGFGVGSA ELTEQMRDDL APVAEMIDSY
GDTETPVRIT GHTDPSGNAQ TNQELSERRA QAVADHLDEV LGEEAPEMEV EGVGSEQPRE
GGAAAQRRAE VAYTVVPQPT SLQSRVMAAE TPDPGAGAGE TAAPVEEDDP VEAAAAAAAE
AAGDGENVLV VEIPDGAVTG AVAFAGLAGG YLLGKRGVRM PGVSLSLPRL PRAPRRLALT
APPPRPTPGD EIDERVTVEL DHVPGLGITG PGAAGAARRL IVNALDRLNG DAVRVLLTEA
EAARLVGDRG RDLLRAHPCE PVRMVGTMEE ALTVLQRELH QIVEEPGERP HTPLALVTSP
TSEHETALSG LLLHGQQQGI TAVVLGRWPL GGSCVIEEDG LITETSPPLN PVFHCSWPGA
TAEEVMDAVR AYRHSSPALE ETGDRVPSPE AAAETVPDVA LEEAGSFAEP FLGALGEQTA
VETAEANEPV ETAESTESTE STESVDAVDA AVGEKDQKRA EAAAKPRRTK GIRKAKAADA
SEEGFWDADV WNTDFWETDA PAAGKADTDR SETESAEAAP SVPERPGALW SEAEPDDTAR
FGAASEVSTR PDTRSDGSVI GRPAVEESTT RPSSSAPSVS EEAVSGSSAD AEPAGTGADR
SAPEPSAAEE AEPRPSAFEQ LAAEEQVSRE TVTGPSESAR PTASAPAPEK AAAEESAPEQ
SASERSAREE GAPVASPSER PSAEQYAAEL SAPAPSATRE QRAPEETASE PAEPRENASA
QVTAPARTEQ PKENAPAEPP AQRPTSEEAV PARPAAERRE PEPEPARPAP GRPSTGTVPS
VVASPARAQR PSQQTRTAPA QQETASEEET APVRKAAKRA VRSRLPKPSG TASAPAAQAS
PSAPAARSEQ LPLPDQPARP ERARARAQNA AASGAEPLTS RAERMERRTR AERRNATAPA
GAVAEHPVPD TEGGESDGSQ PSRPLLRKPK KAGRGRSWRP RDNS