Gene Ndas_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0122 
Symbol 
ID9243953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp150807 
End bp152243 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID 
Productreplication initiation protein 
Protein accessionYP_003678078 
Protein GI297559104 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.93008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACC CCAGCGACGC CACACAGCCC CGTCGCGCAC CCTCCACAGA GCAGATCAGC 
GAACGCGTAG CGACCGGCGC TCTCGACGAC GTGCTCTCCA CCATCAAGCG CGTGCGGGGC
TGCTCTGAGC CGCTGCGGCT GCGCGGACAG CTCTCGACCG TGGACACCGC CACCGGGGCC
TGTGATGCGG TGTGGTCCAC CACCGGCCAA CCCGGACAGG TGCTCATGGT CGCCTGCGGC
AACCGCCGCG CCTCCCGCTG CCCCGCCTGC GCCGACACCT ACCAGGGCGA CACCTTCCAC
CTCATCCGCG CCGGACTCGT CGGCGGAGAC AAGGGCGTCC CCGAAGCCGT GCGCTCCCAC
CCCCGCGTGT TCGCCACCCT GACCGCCCCC AGCTTCGGCC CCGTCCACCG GGGACCGGAC
GCCTCCGGGC GTGCGGTGGT GTGCCACCCC CGCCGCTCGG GTGCGGCCTG CTACCGCCGC
CACACCGCAG ACGATGCCCT CATCGGCCAA CCCTTGGACG CGGAGAGCTA CGACTACGAC
GGGCACGTGC TGTGGAACAA CCACGCCGGG GACCTGTGGA GCCGCTTCAC GGTCTACCTG
CGCCGCCACC TGGCCGACGC CGCCGGGATC GGACGCACCG AGTTCAACCG CACGGTGCGG
GTGTCCTACG CCAAGGTGGC CGAGTTCCAA GCCCGGGGGC TGGTCCACTT CCACGCCGTG
ATCCGCCTGG ACACCAAACG CCCTGACGGC ACCGTGGAAC CTCCCCCGGC CTGGGCGTCG
GTGGAGCTGC TCACCGCCGC CATTCGTTCC GCCGCTGCGG CCGTGGTGGT CCCGGCCGAG
ACCGCCAACG GTTCCCGGTT CCTGTCCTGG GGTGAACAGG TGGACGTCCA CGCGATCACC
TCGGGTGCGT TCGCCTCCGG CGGGGTGGAT GAGGAAGCGG TGGCCGCCTA CATCGCCAAG
TACGCCACCA AGTCCACTAC CGATGACGGC ACCCTGGACC GGCGCGTGTT CGCCGGGGCT
CCGCTGGACC ACCTGGGGTT GAGCGACCAC CAGCGCAGGT TGATCCTGAC CTGCTGGCGT
CTGTCCGAGG TCCCCGGCCT GGAGGAGCGC AAGCTCGACC GGTGGGCGCA CACCCTCGGG
TTTCGGGGCC ACTTCTCCAC CAAGTCGCGC CGCTACTCCA CCACCCTGGG CCAACTTCGG
CAGGTGAGGC GGGATTTCCG CGCCGGGCAG GCACGCGCGA TGGGTCATGA CGACCTGCTC
GGCGACCTGC CCGAGATGAC CGAGGACACC ACGCTCGTAG TCGGCTCGTT CTCCTACGCC
GGGCAGGGCT ACGCACACCC CGTTGACCGG TGGCTGGCCG AGTCCCACCA CCGCAGCCGG
GTCTACAGCC GCCGCGTGGG ACGCGAACAG CTCGCAGACC TGGAAGAGGC CGCCTGA
 
Protein sequence
MPHPSDATQP RRAPSTEQIS ERVATGALDD VLSTIKRVRG CSEPLRLRGQ LSTVDTATGA 
CDAVWSTTGQ PGQVLMVACG NRRASRCPAC ADTYQGDTFH LIRAGLVGGD KGVPEAVRSH
PRVFATLTAP SFGPVHRGPD ASGRAVVCHP RRSGAACYRR HTADDALIGQ PLDAESYDYD
GHVLWNNHAG DLWSRFTVYL RRHLADAAGI GRTEFNRTVR VSYAKVAEFQ ARGLVHFHAV
IRLDTKRPDG TVEPPPAWAS VELLTAAIRS AAAAVVVPAE TANGSRFLSW GEQVDVHAIT
SGAFASGGVD EEAVAAYIAK YATKSTTDDG TLDRRVFAGA PLDHLGLSDH QRRLILTCWR
LSEVPGLEER KLDRWAHTLG FRGHFSTKSR RYSTTLGQLR QVRRDFRAGQ ARAMGHDDLL
GDLPEMTEDT TLVVGSFSYA GQGYAHPVDR WLAESHHRSR VYSRRVGREQ LADLEEAA