Gene Ndas_1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1461 
Symbol 
ID9245311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1788782 
End bp1790464 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content72% 
IMG OID 
ProductRicin B lectin 
Protein accessionYP_003679398 
Protein GI297560424 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.851115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.229288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCATCA ACCCCCCGCT TCCCCCCGGC GCGCCCCGGA CCGCTCCGCC GTCCTCCCCG 
CCCTCCCGCA GACAGCTCTC GCGCCTGTCG CGCCTGCGGC GGCTCCTGTA CTCCGCCCTG
GCCGTCGTCC TCTGCGCGAG CGGCCTCTCC GCCGCCGCCG TCACCCCCGC CCGGGCCGCC
GACATCGACA CCGGCGCCTA CTACGTCCTG CGCAACCAGC ACAGCGGCCT GGTCGCCGAC
GTCGAGAGCG CCGGAACCCA GGACGGCGCG CGGATCATCC AGTGGGAACG CACCGACCGC
CCCTGGCAGC AGTTCCGCTT CGTCCCCTCC GGCGACGGCT ACTACCGCCT CGTCAACCGC
CACAGCGGTA AGGCCGTCGA CGTCTGGGAG CACTCCACCG CCAACGGCGC CGAGATCCGC
CAGTTCACCG ATCTCGGCAA CGCCAACCAG CAGTGGCGCC CCGTGGACAC CGGCGGCGGC
GTCCAGCTGA TCAACCGCCT CAGCGGCAAG GCCCTGGAGG TCTGGGAGTG GAGCACCACC
CCGGGCGACC GCCTGTCGCA GTACGACTCC CTCGGCGGAG CCAACCAGGT CTGGGACCTG
GTCCGCGTGG ACGACACCGG CGGTGGGGGA GACGGCGACT GCGGCAGCGG CTCCCACCAC
GCCGAAGCGG TGCGGAACGG CTCCACCTGG ACCGCCCGCA ACGGCGGCAG CACCGTCTAC
ACCGGCGGCG ACATGCTCGC CGCCATGCGC GCGGCCGTCG GCAGCCTCGA CTCCGGCCGC
ACCTCCCAGC AGCGCGTGGT GGTGCGCGGA TCCGGTTCCA TGCCCGCCAA CACCTCGCTC
GACCTGCCCA GCCACACCTC ACTGGAGGTC TGCGGCACCA TCCACGTGTC CGGGTCGGTG
GGCGCCGACC ACGCCGCCGT CCGGATCCGC AACGCCCAGA ACGTCTCCGT CCCCCACCTG
TCCGTGACCG GCTCGCCGTA CTTCGGCGTC TTCGTGCGCG GCTCGCAGAA CGTCCACTTC
GGCCAGATCG ACCTGCGCCT GTCCAGCGGC CTGGGCATGC GCATCGACAG CCGGGGCAGC
GACGCCAACC GCACCACGCG CGACATCAGC ATCAACGACG TGTACGTGTC GGGCACCGAC
AACCACGGCG TGGAGACCTA CAGCGTGGAC GGCCTGGACA TCGGCACCGT CACGGCCCGC
GACACCGGCT ACTCGGGCCT GCTGCTCAAC AACACCGTCA ACGCCACGGT GGACCGGGTG
GACGCCGAGG GCGCCGGGAC CGGAACCGGC TACGCGGCCT TCCGCATGGC CAACCGCAAC
GGGCGGATCG GCAGCGACTA CCCGACCAAC ATCCGGGTCG GCGAGGTCCG GGCCCGCGGC
GGCGGCCGGG GGGTCTTCTG CGTCTCCGAG AGCGGCGGCG CGGTCATCGA CCGCGTGGAC
ATCGCCCAGA CCGGCAACAA CGCGGTGCTG GTCGAGAACT GCCACAACGT CACCTTCTCC
GGGGGCACGA TCGCCGGTCC GGGCAGCGTC CGGATCGCGG CCCGCTCGGA GTTCGCCAAC
ACCTCGAACG TCACGTTCCA GAACCTGACG CTGGCCAACA CCTCACTGGT CGAGAACCCG
TGCTCGGTGA ACCTGACCGT CCGCAACGTC ACCTTCCAGA GCAGCAGCGA CCAGACCTGC
TGA
 
Protein sequence
MPINPPLPPG APRTAPPSSP PSRRQLSRLS RLRRLLYSAL AVVLCASGLS AAAVTPARAA 
DIDTGAYYVL RNQHSGLVAD VESAGTQDGA RIIQWERTDR PWQQFRFVPS GDGYYRLVNR
HSGKAVDVWE HSTANGAEIR QFTDLGNANQ QWRPVDTGGG VQLINRLSGK ALEVWEWSTT
PGDRLSQYDS LGGANQVWDL VRVDDTGGGG DGDCGSGSHH AEAVRNGSTW TARNGGSTVY
TGGDMLAAMR AAVGSLDSGR TSQQRVVVRG SGSMPANTSL DLPSHTSLEV CGTIHVSGSV
GADHAAVRIR NAQNVSVPHL SVTGSPYFGV FVRGSQNVHF GQIDLRLSSG LGMRIDSRGS
DANRTTRDIS INDVYVSGTD NHGVETYSVD GLDIGTVTAR DTGYSGLLLN NTVNATVDRV
DAEGAGTGTG YAAFRMANRN GRIGSDYPTN IRVGEVRARG GGRGVFCVSE SGGAVIDRVD
IAQTGNNAVL VENCHNVTFS GGTIAGPGSV RIAARSEFAN TSNVTFQNLT LANTSLVENP
CSVNLTVRNV TFQSSSDQTC