Gene Ndas_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1817 
Symbol 
ID9245667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2222603 
End bp2224249 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content73% 
IMG OID 
ProductSpore coat protein CotH 
Protein accessionYP_003679751 
Protein GI297560777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.572718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGAC TCGATGATGA GGAGGGCGGC GGCGACACCG CTGCCCGCGA CACCGCCGCC 
GGGCGGCGCC AGCGCGGGCC CGCGCGCCTG CGGCACCGCC TCCCGGTGCG CCTGCGCCAG
CACTGGAGGA GCACCGCCGT CGTGTGCGCC GCGCTGCTCG CCCTGGTCCT GGTCTTCGGC
GAGGCGCGGG TGCGCCCCTA CGTCACCTCC GAGTTGGTCT CGCGGGACGC CGTCACCCAG
AACATCGAGG GCGAGGGCGA CCTGTTCGAC GGCGGGGAGC ACACGATCGA GGTGAGCTTC
GACCAGACCG AGTACGCCGA CATGATGAGC ACCTTCCGCG AGGAGGGCGA GAAGGAGTAC
ATCCGGGCCG ACGTCACCAT CGACGGCACC ACCGTCGAGG ACGTCGGACT CCGCCTCAAG
GGCAACTCGA CCCTGATGAG CCTGCGCGGC GACTCCGGGG CAGGGGCGGC GCCCGGCCAA
GGGGGCCAGC AGCGGGAGGA GACCGGTACG GACACCGGCA CCGGGGACGC CGGCGGGGTC
GCTGACGGGA CGACCGGCGC GGCGGCCGAC ACCAGGGGTG AACAGGGGAC CGGCCCGGGC
GGCGGGGGAG CGGCCGGGCC GGGCTCGGTC ACCCTGTCCG AGGACGAGCC GGAGAACCTC
CCCTGGTTGA TCAGCTTCGA GGAGTTCGTC TCCGGCCGGG CCTACCAGGG GCACACCGAG
ATCGCCCTGC GCCCGGCGAC GGCGACCTCC GACACCGCGC TCAACGAGGC CCTGGCCCTG
GAGCTGACGG CGGCGGCCGG GCAGACCACG CAGGACTACA CGTTCACCTC GTTCGCGGTC
AACGGCGGTG AGGGGGTGCC GCGCCTGCTG CTGGACGCGC CCGACGCCGC CTGGGCCGAC
GGGTACGGGG ACGGCGTGCT CTACAAGGGC CGCGCCGGGG GCTCCTTCGA CTACCTGGGC
GGGGACGCCA CCGACTACGA GGAGGCCTTC AACCAGATCA ACGGCGAGGG CGCCCACGAC
CTCCAGCCGG TGATGGACCT GCTGCGGTTC GTGGCCGAGT CCGACGACGA GGAGTTCGCC
CGCGAGCTGG ACGAGCACCT GGACACCGAG TCCTTCGCCC GCTACCTCGC GCTGCAGAGC
CTGATGTCCA ACAGCGACGG CATGGACGGC CCGGGCAACA ACTACTACCT GTGGTACGAC
ACCGCCCAGG AGCGGTTCAC CGTCCTGTCC TGGGATCTGA ACCTGTCCTT CGGCGGTATG
GGTGGCATGG GCGGCGGTGG GGGAGCGATG CCGGAGGGGG CGCAGTTCCC CGGCGGGGGC
ACACCCCCCG GTGGGACGTG GCCGCCCGAG GGCATGGAGG CGCCCGGGGG AATGCCCGCG
CCAGGAGAAG GGGAGGCGGT CGGGGGCGGC ATGTCCATGG GCGGCAGTGG GGCGCTCAAG
GAGCGCTTCC TCGCCGACGA GGGCTTCCAC CGCCTCTACG AGGAGGCCTA CGCCGACCTC
TACCAGGACC TCGTCGGGAG CGGCACCGCC GCCGAACTCC TGGAGTCGGC CGTCTCCCGG
GCCGGGGCCA CTGGCGACAC GGGGGCCGAC GCCGCGGGCG AGCAGCTCGC CGAGCGGATC
GCCGCCGTCT CCCCGGCAAT CGAGTGA
 
Protein sequence
MRGLDDEEGG GDTAARDTAA GRRQRGPARL RHRLPVRLRQ HWRSTAVVCA ALLALVLVFG 
EARVRPYVTS ELVSRDAVTQ NIEGEGDLFD GGEHTIEVSF DQTEYADMMS TFREEGEKEY
IRADVTIDGT TVEDVGLRLK GNSTLMSLRG DSGAGAAPGQ GGQQREETGT DTGTGDAGGV
ADGTTGAAAD TRGEQGTGPG GGGAAGPGSV TLSEDEPENL PWLISFEEFV SGRAYQGHTE
IALRPATATS DTALNEALAL ELTAAAGQTT QDYTFTSFAV NGGEGVPRLL LDAPDAAWAD
GYGDGVLYKG RAGGSFDYLG GDATDYEEAF NQINGEGAHD LQPVMDLLRF VAESDDEEFA
RELDEHLDTE SFARYLALQS LMSNSDGMDG PGNNYYLWYD TAQERFTVLS WDLNLSFGGM
GGMGGGGGAM PEGAQFPGGG TPPGGTWPPE GMEAPGGMPA PGEGEAVGGG MSMGGSGALK
ERFLADEGFH RLYEEAYADL YQDLVGSGTA AELLESAVSR AGATGDTGAD AAGEQLAERI
AAVSPAIE