Gene Ndas_3393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3393 
Symbol 
ID9247258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4056331 
End bp4057746 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID 
Productputative secreted protein 
Protein accessionYP_003681304 
Protein GI297562330 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.327426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAC CTCCCGCAGC ACCACGGGCC AGGAAGGCGC TGCGGCGCAC CGCGGCCCTC 
GCGTCCGCCG CGGCCGCGTC CCTGCTCCTC GGGCTCCTCG GCCCGGCCGC CGCCCAGGCG
GACGAGGCGC CCGCCGAAGA CATCCACAGC GACAACATCA CCCACGTCGC CCACACGCCC
AAGCCCTCGG CCGTGCGCAA CGTCAACTCC GACCTGGCGT TCAGCGGCGA CTACGCCATC
GGCGGCAACT ACGACGGCTT CGTCATCTAC GACATCTCCG AGCCGGAGGA GCCGCAGGTC
GTCTCCGAGG TGCTGTGCCC GGGCGGACAG GGCGACGTGT CGGTCAGCGG CGACCTGCTC
TACTTCTCGG TGGACTATCC GCGAGCGAGC ACCGAGTGCG GGGCACCCTC CGTCCCGGTG
ACCGACCCGG ACGGCTTCGA GGGGATCCGG ATCTTCGACA TCTCCGACAA GGCCAACCCC
CAGTACGTGT CGGCGGTGCG CACCGACTGC GGCTCGCACA CCAACACCCT GGTGCCGAGC
AAGACCGGTG ACAGCGACCT GATCTACGTG TCGTCGTACT CGCCCTCGGA GCGCTTCCCG
AACTGCCAGC CGCCGCACGA CAAGATCTCC GTCATCGAGG TCCCGCACGA CGCTCCCGAG
GAGGCCGCGG TCGTCAACGA ACCGGTCCTG TTCCCCGAGG GCGGCAACCA CGAGCAGGAC
GGGCTGCTGC TGCCCACCCA GGGCTGCCAC GACATCACCG TCTACGCCGA GCGCGACATC
GCCGCGGGCG CCTGCATGGG CGACGGCGTG CTGATGGACA TCTCCGACCC GGTGAACCCG
GTCGTCACCG AGGTGGTCCA GGACGAGAAC TTCGCGTTCT GGCACTCGGC GACCTTCACC
AACGACGCCC GGACCGTGCT GTTCACCGAC GAGCTCGGCG GAGGCGGCGC CCCGACCTGC
ACCGAGGAGG TCGGCCCCCA GCGCGGCGCC AACGCCATCT ACGCCATCGG CGGCGGCGAC
TCGCCGGAGC TGGAATTCGC CAGCTACTTC AAGATCGACC GCCACCAGGG CGACCAGGTG
TGCGTGGCGC ACAACGGCTC GCTGATCCCG GTGCCCGGCC AGGACTACTT CGTGCAGTCG
TGGTACCAGG GCGGCGTCTC GGTGATCGAC TTCAACGACC CGGGCGCCCC GAGCGAGATC
GGCTTCTTCG ACGTGGACTC CCGCGTCGAG GAGGGTGTGC AGGACAACGA CACCTGGTCG
ACGTACTACT ACAACGGCTA CGTGTACTCG TCCGACATCG AACGCGGCCT GGACGTGCTG
CGGATCGACG ACCCGCGCGT GCGCGCGGCC GAGCGGGTGC GGATGGAGGA GTTCAACCCG
CAGAGTCAGG AGAGCTACCG GCCGGGACGG CGCTGA
 
Protein sequence
MPAPPAAPRA RKALRRTAAL ASAAAASLLL GLLGPAAAQA DEAPAEDIHS DNITHVAHTP 
KPSAVRNVNS DLAFSGDYAI GGNYDGFVIY DISEPEEPQV VSEVLCPGGQ GDVSVSGDLL
YFSVDYPRAS TECGAPSVPV TDPDGFEGIR IFDISDKANP QYVSAVRTDC GSHTNTLVPS
KTGDSDLIYV SSYSPSERFP NCQPPHDKIS VIEVPHDAPE EAAVVNEPVL FPEGGNHEQD
GLLLPTQGCH DITVYAERDI AAGACMGDGV LMDISDPVNP VVTEVVQDEN FAFWHSATFT
NDARTVLFTD ELGGGGAPTC TEEVGPQRGA NAIYAIGGGD SPELEFASYF KIDRHQGDQV
CVAHNGSLIP VPGQDYFVQS WYQGGVSVID FNDPGAPSEI GFFDVDSRVE EGVQDNDTWS
TYYYNGYVYS SDIERGLDVL RIDDPRVRAA ERVRMEEFNP QSQESYRPGR R