Gene Ndas_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1492 
Symbol 
ID9245342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1828999 
End bp1830153 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003679428 
Protein GI297560454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0362438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCCT CCAGCATCAT CTCCGCCGTG GGCACAGGAG CCCTGGCCTT CGGCATGGCA 
CTGGCCATGG CCCCCGGAGC CCTCGCGGCG CCGGCCCCCG TCCCCCAGAC CCCCGTCGCC
GACGACAGCG CCGCCAGCAT GACCGAGGCG CTCAAGCGCG ACCTCGACCT CACATCGGCC
GAGGCCGAGG AGCTGCTCTC GGCGCAGGAA GCCGCCATCG AGACCGACGC CGAGGCCACC
GAGGCCGCGG GCGAGGCCTA CGGCGGTTCC CTGTTCGACA CCGAGACCCT CGAACTCACC
GTGCTGGTCA CCGACGCCTC CGCCGTCGAG GCGGTCGAGG CCACCGGAGC CCAGGCCACC
GTCGTCTCCC ACGGCACCGA GGGCCTGACC GAGGTCGTCG AGGACCTCAA CGGCGCCGAG
GTTCCCGAGA GCGTCCTCGG CTGGTACCCG GACGTGGAGA GCGACACCGT CGTGGTCGAG
GTGCTGGAGG GCTCCGACGC CGACGTCGCC GCCCTGCTCG CCGACGCCGG TGTGGACTCC
TCCTCGGTCC GGGTGGAGGA GACCGAGGAG GCCCCGCAGG TCTACGCCGA CATCATCGGC
GGCCTGGCCT ACTACATGGG CGGCCGCTGC TCCGTCGGCT TCGCCGCGAC CAACAGCGCC
GGTCAGCCCG GTTTCGTCAC CGCCGGCCAC TGCGGCACCG TCGGCACCGG CGTGACCATC
GGCAACGGCA CCGGCACCTT CCAGAACTCG GTCTTCCCCG GCAACGACGC CGCCTTCGTC
CGCGGTACCT CCAACTTCAC CCTGACCAAC CTGGTCTCGC GCTACAACTC CGGCGGCTAC
CAGTCGGTGA CCGGTACCAG CCAGGCCCCG GCCGGCTCGG CCGTGTGCCG CTCCGGCTCC
ACCACCGGCT GGCACTGCGG CACCATCCAG GCCCGCAACC AGACCGTGCG CTACCCGCAG
GGCACCGTCT ACTCGCTCAC CCGTACCAAC GTGTGCGCCG AGCCCGGTGA CTCCGGCGGT
TCGTTCATCT CCGGCTCGCA GGCCCAGGGC GTCACCTCCG GCGGCTCCGG CAACTGCTCC
GTCGGCGGCA CGACCTACTA CCAGGAGGTC ACCCCGATGA TCAACTCCTG GGGCGTCAGG
ATCCGCACCA GCTGA
 
Protein sequence
MRPSSIISAV GTGALAFGMA LAMAPGALAA PAPVPQTPVA DDSAASMTEA LKRDLDLTSA 
EAEELLSAQE AAIETDAEAT EAAGEAYGGS LFDTETLELT VLVTDASAVE AVEATGAQAT
VVSHGTEGLT EVVEDLNGAE VPESVLGWYP DVESDTVVVE VLEGSDADVA ALLADAGVDS
SSVRVEETEE APQVYADIIG GLAYYMGGRC SVGFAATNSA GQPGFVTAGH CGTVGTGVTI
GNGTGTFQNS VFPGNDAAFV RGTSNFTLTN LVSRYNSGGY QSVTGTSQAP AGSAVCRSGS
TTGWHCGTIQ ARNQTVRYPQ GTVYSLTRTN VCAEPGDSGG SFISGSQAQG VTSGGSGNCS
VGGTTYYQEV TPMINSWGVR IRTS