Gene Ndas_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1223 
Symbol 
ID9245073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1523929 
End bp1525287 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content74% 
IMG OID 
Producttype II secretion system protein E 
Protein accessionYP_003679168 
Protein GI297560194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00198137 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCAGCG GACACGACGC CGGATACGAC ATCACCGACG GCTACTTCCT GGACACCGAC 
CGGGCCCGCA CCGTGGCCGA CCGCGTGGAG CGGACCGTCA CCGAGGCCAC CCGGCGGCTC
AGCGAGGAGA CCCGCGGCCA ACCCGAGGAC ACGCCCGAGG AGTACCGCGC CCGCGCCGAA
CGCGTCATCG CCCAGATCCT CGACGAGGAC GCGAGCCGCG CCCTGTCCGA GGGCAGGCAG
GTGCTCGACG CCACTACCGA GGCCTCCGTC GCCGCTGGCG CCCTGGCCCG GGTGTGCGGG
CTCGGACCCC TCCAGCCGCT CCTGGACGAT CCCGGGATCG AGAACATCAA CATCAACGGC
GTGCACGTGT GGGTGCGCCG CGCCGACGGA AGCCGCGAGC GGCACGACTC CCTCTTCGAC
GACCCCGACG AGGTCGTCGC CCTGGTCCGG CGCCTGGCGT CGGAGTCCGC CACGGGGGAG
CGCCGCTTCG ACCCGGGCGC GCCCATCCTC GACATGCAGC TGCCCGGCGG CGAGCGCCTC
AACGCGGTCA TGGAGGTCGC CCGGCAGCCC TCGGTGTCCA TCCGCCGCCA CCGCTACGGC
CGCACCACCC TCGAACAGCT CCACGGACTG GGCACGATCG ACGACACCCT GGTCGCCCTG
CTGCGCGCGG GGGTGCGCTC CCGCCGCAAC ATGGTCATCA CCGGCGGCAC CGGCGCGGGC
AAGACCACCC TGCTGCGGGC CCTGGCCGCC GAGATCCCCG TGGACGAACG CCTCGTCACC
ATCGAGGACG TCTTCGAACT CGGGCTGGAC CGCGACCGCG ACGCCCACCC CGACTGCGTC
GCCCTCCAGG CCCGACCGGC CAACGTCGAG GGCGTCGGCG AGATCACCAT CGCCGACCTG
GTGCGCACGG CCCTGCGCAT GTCGCCGGAC CGGGTCATCG TCGGTGAGAC GCGCGGCCAC
GAGACCGTCC CGCTGCTCAA CGCCATGAGC CAGGGCAACG ACGGCAGCCT CACCACCCTG
CACGCCGCCA ACTCCGCGGG CGCCTTCACC AAGCTGGGCG CCTACGCCGC CCAGTCCGCC
GAGCGCCTGC CCCTGGACGC CACCGCGTCC CTGGTGGCCG CCGCCGTGCA CCTGGTCGTG
CACGTGTCGG CGCTGCCCAC GGGCGGGCGC ATGGTCACGA GCGTGCGCGA GGTGGTGGGG
GCCGAGGGGC AGAACGTGGT CTCCAACGAG ATCTACCGGC GCGACCGCAA CGGGCCCCTG
CCCGCGGCGC CGCCCAGCCC CAACACGCTC GACGCCCTGG CCGAGGCCGG GTTCGACCCG
GCGATGCTGA GCCCGGACAC AGTGGGGTGG GCACGGTGA
 
Protein sequence
MSSGHDAGYD ITDGYFLDTD RARTVADRVE RTVTEATRRL SEETRGQPED TPEEYRARAE 
RVIAQILDED ASRALSEGRQ VLDATTEASV AAGALARVCG LGPLQPLLDD PGIENINING
VHVWVRRADG SRERHDSLFD DPDEVVALVR RLASESATGE RRFDPGAPIL DMQLPGGERL
NAVMEVARQP SVSIRRHRYG RTTLEQLHGL GTIDDTLVAL LRAGVRSRRN MVITGGTGAG
KTTLLRALAA EIPVDERLVT IEDVFELGLD RDRDAHPDCV ALQARPANVE GVGEITIADL
VRTALRMSPD RVIVGETRGH ETVPLLNAMS QGNDGSLTTL HAANSAGAFT KLGAYAAQSA
ERLPLDATAS LVAAAVHLVV HVSALPTGGR MVTSVREVVG AEGQNVVSNE IYRRDRNGPL
PAAPPSPNTL DALAEAGFDP AMLSPDTVGW AR