Gene Ndas_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0038 
Symbol 
ID9243865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp49274 
End bp50449 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003677996 
Protein GI297559022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0144084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.178822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA CCCCCCTGAC CGCCGCCTCC TGCGCGGTCG TCCTCGCCGT GACCGCCTGC 
GGAACCCCCG GCGAGGCCGC CGGGGGAGGC GAGGACGCGC CCGTCAGGGT CGGCATCGTC
TACTCCGCCA CCGGCCCCCT GGCCACCTAC GGCGAGCAGT ACCGGCAGGG CTTCGAGGCC
GGACTCGACC ACGCCACCGG CGGGACGATG GAGATCGACG GCCGCCCGAT CGAGGTCGAG
TACATGGACG ACGCAGGCGA CCCCACGAAG GCGGTCACGG CCACCCGCGA CCTCATCGGC
ACCGGCCACG ACATCATCGC CGGGTCCACC GCCTCCGGCA TCGCCGTCCA GGTCGCCCCG
CTCGCCGAGC AGAACGACAT CCTCTTCATC TCCGGATCCG CCGCCACCGA CGCCGTCACC
GGCGTCAACG ACCACACCTT CCGCTCCGGG CGCCAGACCT ACCAGGACAT CCTCACCGCC
GGAACCTTCA TGGACGACCC CGAGGGCGCG GACGTCCTGG TCCTGGCCCA GCAGAACGCC
TTCGGCCAGG ACAACGTCGC CGCCGTCACC GACGTCCTCG GGGCCGAGGG GGCCGACGTC
GACAGCGTCC TGGCCCCGCC GGAGACCACC GACCTCACCC CGTTCGCCGA GCAGGTCAGC
CAGGCCGAAC CCGACCTGGT CTTCGTCGCC TGGGCGGGCG AGACCGCCTC CGCCATGTGG
CGGGCACTGG ACCAGCAGGG CATCCTCGAC TCCACCGAGG TCGTCACCGG ACTGGACATC
AAGCCGTCCT ACCCGGTCTT CGGCGAGGCG GGCGGCCGCA TCTCGTTCCT CTCCCACTAC
TTCGACGGCG CCTCCGACAC CGAACTGGCC CGGACCATGA AGGAGTCCGT CGAGGAGGCG
GGCGGCACCG TGGACCTCTT CACCCCCGAC GGGTTCACCG CCGCGCAGAT GGTCGTGCAC
GCCGCCGGGG CCGGGGACGA AGTCCGGGAG CGCATCGACG CCCTGGAGGG CTGGACCTTC
GACGGCGTCA AGGGTGAGCT CACCATCCGC GCCGAGGACC ACGCCCTCCT CCAGCCCATG
TACCAGGTCG AACTGGTCGG CGAGGGCGAG GACGCCCACC CCGAACTGGT CGCGGAGATC
CCCGCCGCGG ACGTCGACCC CGCGGTCGCG GAGTAG
 
Protein sequence
MRKTPLTAAS CAVVLAVTAC GTPGEAAGGG EDAPVRVGIV YSATGPLATY GEQYRQGFEA 
GLDHATGGTM EIDGRPIEVE YMDDAGDPTK AVTATRDLIG TGHDIIAGST ASGIAVQVAP
LAEQNDILFI SGSAATDAVT GVNDHTFRSG RQTYQDILTA GTFMDDPEGA DVLVLAQQNA
FGQDNVAAVT DVLGAEGADV DSVLAPPETT DLTPFAEQVS QAEPDLVFVA WAGETASAMW
RALDQQGILD STEVVTGLDI KPSYPVFGEA GGRISFLSHY FDGASDTELA RTMKESVEEA
GGTVDLFTPD GFTAAQMVVH AAGAGDEVRE RIDALEGWTF DGVKGELTIR AEDHALLQPM
YQVELVGEGE DAHPELVAEI PAADVDPAVA E