Gene Ndas_1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1493 
Symbol 
ID9245343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1830963 
End bp1832108 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content73% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003679429 
Protein GI297560455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.03081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCCT CCCCCGCTAT CTCCGCTATC GGCACCGGCG CACTCGCGTT CGGTCTGGCG 
TTCTCCGTGA CGCCGGGCGC CAGTGCGGCG ACCGTACCGG CCGAGCCAGC GAGCGAGGCC
CAGACGATGA TGGAAGCGCT GCAGAGAGAC CTCGGCCTCA CCCCGCTCGG GGCCGAGGAG
CTGCTCTCGG CGCAGGAAGA GGCGATCGAG ACCGACGCCG AGGCCACCGA GGCCGCGGGA
GCGTCCTACG GCGGCTCCCT GTTCGACACC GAGACCCTCC AGCTCACCGT GCTGGTGACC
GACGCCTCGG CCGTCGAGGC GGTGGAGGCC ACCGGCGCCG AGGCCACCGT GGTCTCACAC
GGCACCGAGG GCCTGGCCGA GGTGGTCGAC GCGCTCGACG AGACCGGCGG CCGGGAAGGG
GTCGTCGGCT GGTACCCGGA TGTGGAGAGC GACACCGTCG TGGTCCAGGT CGCCGAGGGC
GCCAGCGCCG ACGGCCTCAT CGAGGCCGCG GGCGTGGACC CCTCCGCCGT CCGGGTGGAG
GAGACCAGCG AGACGCCGCG CCTGTACGCC GACATCGTCG GCGGCGAGGC GTACTACATG
GGCGGCGGAC GCTGCTCGGT CGGGTTCGCC GTGACCGACG GTTCCGGCGC GGGCGGCTTC
GTGACGGCGG GCCACTGCGG CACCGTCGGC ACCGGCGCCG AGAGCTCCGA CGGCAGCGGA
TCCGGAACCT TCCAGGAGTC CGTCTTCCCG GGCAGCGACG GCGCCTTCGT CGCGGCCACC
TCCAACTGGA ACGTGACCAA CCTGGTCAGC CGGTACGACT CCGGCAGCCC CCAGGCGGTG
TCGGGTTCCA GCCAGGCCCC GGAGGGCTCG GCGGTGTGCC GCTCCGGCTC CACCACCGGC
TGGCACTGCG GGACCATCGA GGCCCGCGGC CAGACGGTGA GCTACCCGCA GGGCACGGTC
CAGGACCTGA CCCGGACGGA CGTGTGCGCC GAGCCCGGTG ACTCCGGCGG CTCGTTCATC
GCCGGTTCGC AGGCCCAGGG CGTCACCTCC GGCGGCTCGG GCAACTGCAC TTCCGGCGGC
ACGACCTACT ACCAGGAGGT CACTCCCCTG CTGAGCAGCT GGGGGCTGTC CCTGGTGACC
GGGTAA
 
Protein sequence
MRPSPAISAI GTGALAFGLA FSVTPGASAA TVPAEPASEA QTMMEALQRD LGLTPLGAEE 
LLSAQEEAIE TDAEATEAAG ASYGGSLFDT ETLQLTVLVT DASAVEAVEA TGAEATVVSH
GTEGLAEVVD ALDETGGREG VVGWYPDVES DTVVVQVAEG ASADGLIEAA GVDPSAVRVE
ETSETPRLYA DIVGGEAYYM GGGRCSVGFA VTDGSGAGGF VTAGHCGTVG TGAESSDGSG
SGTFQESVFP GSDGAFVAAT SNWNVTNLVS RYDSGSPQAV SGSSQAPEGS AVCRSGSTTG
WHCGTIEARG QTVSYPQGTV QDLTRTDVCA EPGDSGGSFI AGSQAQGVTS GGSGNCTSGG
TTYYQEVTPL LSSWGLSLVT G