Gene Ndas_2446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2446 
Symbol 
ID9246296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2899680 
End bp2900744 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content72% 
IMG OID 
Productchitin-binding domain 3 protein 
Protein accessionYP_003680372 
Protein GI297561398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTCAC GCAGCACCCT CGCCATGGCC GTGGCGCTCG GCGGCCTCGC CGTCGGCACC 
ACCGTCGTCG CGCTGCCCGA CACGGCCCAG GCCCACGGCG CGTTCACCTA CCCGGCCAGC
CGCACCTACG CCTGCTTCCA GGACGCCACC GGCGGCAGCA GCGGCGGCGC GCTCGCGCCG
ACCAACGACG CCTGCGCCGA CGCGCTGGCC GAGGGCGGCA ACTACCCGTT CTGGAACTGG
TTCGGCAACC TCATCAGCAA CTCCGACGGG CGGCACCAGG AGTTCGTGGC CGACGGCGAG
CTGTGCGGAC CCACCGACAG CTTCTCCGCC TTCAACGCGG TCCGCGCCGA CTGGCCGACC
ACCGAGCTGC CCGCCGACAC GACGGTGGAG TTCCACCACA ACGCCTGGGC CGCGCACCCG
GGCACCTTCT ACACCTACGT CACCGAGGAC GGGTTCGACC CCGCGACCGA CGCGCTGACC
TGGGACTCCC TGGAGCTCAT CGACGAGGTC ACCGACCCGC CGCTGCGCAG CGGCGGCGTC
GCCGGAGCCG AGTACTACTG GGACGTGGAC CTGCCCGACA AGGAGGGCCA GCACGTCATC
TACGTGGTGT GGGAGCGCTC CGACAGCCCC GAGGCGTTCT ACAACTGCTC CGACGTGGTC
TTCGAAGGGG GCTCGGGCGG CAACCCGGAC CCCGAGCCGG AGCCGGAGCC GGAACCCGAG
CCGGAGCCGG AGCCGGAGCC GGAGCCCACG GACCCGCCGG CGGGTGAGTA CTGCACGGCC
GAGTACACGG TCCTCAACGA GTGGCAGGGC GGCTTCCAGG CCGAGGTCGA GGTCACCGCG
GGCGAGAACG GGACCGACGG CTGGATGGTG GACTGGGTGT TCGCCAACGG CCAGGCGGTC
AGCAGCGCCT GGAACGCCTC GCTCGTCAAC CACGGCGCCC ACTTCGAGGC CAGGAACGCC
GCGCACAACG GCTCGCTCGC GGCCGGGGAG AGCGCCAGCT TCGGGTTCAC CGCCACCTCG
GGGAACGTGA ACATCGAGCC GCGGGTGACC TGCCAGGAGC CGTGA
 
Protein sequence
MRSRSTLAMA VALGGLAVGT TVVALPDTAQ AHGAFTYPAS RTYACFQDAT GGSSGGALAP 
TNDACADALA EGGNYPFWNW FGNLISNSDG RHQEFVADGE LCGPTDSFSA FNAVRADWPT
TELPADTTVE FHHNAWAAHP GTFYTYVTED GFDPATDALT WDSLELIDEV TDPPLRSGGV
AGAEYYWDVD LPDKEGQHVI YVVWERSDSP EAFYNCSDVV FEGGSGGNPD PEPEPEPEPE
PEPEPEPEPT DPPAGEYCTA EYTVLNEWQG GFQAEVEVTA GENGTDGWMV DWVFANGQAV
SSAWNASLVN HGAHFEARNA AHNGSLAAGE SASFGFTATS GNVNIEPRVT CQEP